head-to-head

xAI: Grok 4.20 vs Qwen: Qwen3.5-122B-A10B

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-22.

xAI: Grok 4.20 Qwen: Qwen3.5-122B-A10B
Vendorx-aiqwen
Quality Score100100
Benchmark Score61.554.1
Input Price$1.25/M$0.26/M
Output Price$2.50/M$2.08/M
Context Window2,000,000262,144
Max Output-262,144
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index61.053.3
ai_index_agentic-34.2
ai_index_coding-75.4
eqbench55.8-

Who wins by task?

TaskxAI: Grok 4.20Qwen: Qwen3.5-122B-A10B
SQL Generation 144 155
Code Review 150 148
Code Completion 122 129
Code Refactoring 153 145
Bug Fixing 154 156
Unit Test Generation 135 140
Code Documentation 141 133
Regex Writing 127 129
CI/CD Pipelines 131 132
Frontend Component Design 131 137
Data Analysis 136 152
CSV / Spreadsheet Cleanup 139 142
ETL Scripting 142 139
JSON Extraction 123 143
Bulk Data Labeling 120 133
OCR / Document Parsing 135 139
Table Extraction from PDFs 135 139
Long-Document Summarization 154 143
Short-Form Summarization 119 128
Blog Post Writing 132 130

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs xAI: Grok 4.20 MoonshotAI: Kimi K2.7 Code vs Qwen: Qwen3.5-122B-A10B Qwen: Qwen3.7 Plus vs xAI: Grok 4.20 Qwen: Qwen3.7 Plus vs Qwen: Qwen3.5-122B-A10B MiniMax: MiniMax M3 vs xAI: Grok 4.20 MiniMax: MiniMax M3 vs Qwen: Qwen3.5-122B-A10B StepFun: Step 3.7 Flash vs xAI: Grok 4.20 StepFun: Step 3.7 Flash vs Qwen: Qwen3.5-122B-A10B