head-to-head

xAI: Grok Build 0.1 vs Qwen: Qwen3.5-9B

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-22.

xAI: Grok Build 0.1 Qwen: Qwen3.5-9B
Vendorx-aiqwen
Quality Score100100
Benchmark Score-40.0
Input Price$1.00/M$0.10/M
Output Price$2.00/M$0.15/M
Context Window256,000256,000
Max Output-32,768
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index-41.2
ai_index_coding-47.3

Who wins by task?

TaskxAI: Grok Build 0.1Qwen: Qwen3.5-9B
SQL Generation 130 144
Code Review 126 139
Code Completion 116 129
Code Refactoring 127 138
Bug Fixing 130 143
Unit Test Generation 121 132
Code Documentation 125 130
Regex Writing 119 126
CI/CD Pipelines 117 126
Frontend Component Design 122 131
Data Analysis 124 138
CSV / Spreadsheet Cleanup 127 137
ETL Scripting 122 132
JSON Extraction 123 139
Bulk Data Labeling 121 132
OCR / Document Parsing 128 134
Table Extraction from PDFs 128 134
Long-Document Summarization 129 137
Short-Form Summarization 115 127
Blog Post Writing 118 125

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs xAI: Grok Build 0.1 MoonshotAI: Kimi K2.7 Code vs Qwen: Qwen3.5-9B Qwen: Qwen3.7 Plus vs xAI: Grok Build 0.1 Qwen: Qwen3.7 Plus vs Qwen: Qwen3.5-9B MiniMax: MiniMax M3 vs xAI: Grok Build 0.1 MiniMax: MiniMax M3 vs Qwen: Qwen3.5-9B StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1 StepFun: Step 3.7 Flash vs Qwen: Qwen3.5-9B