head-to-head

StepFun: Step 3.7 Flash vs Qwen: Qwen3.5 Plus 2026-04-20

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-22.

StepFun: Step 3.7 Flash Qwen: Qwen3.5 Plus 2026-04-20
Vendorstepfunqwen
Quality Score100100
Benchmark Score48.0-
Input Price$0.20/M$0.30/M
Output Price$1.15/M$1.80/M
Context Window256,0001,000,000
Max Output256,00065,536
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index49.1-
ai_index_agentic35.5-
ai_index_coding61.6-

Who wins by task?

TaskStepFun: Step 3.7 FlashQwen: Qwen3.5 Plus 2026-04-20
SQL Generation 152 133
Code Review 145 132
Code Completion 129 131
Code Refactoring 143 136
Bug Fixing 154 136
Unit Test Generation 138 124
Code Documentation 132 131
Regex Writing 129 119
CI/CD Pipelines 131 120
Frontend Component Design 135 122
Data Analysis 149 124
CSV / Spreadsheet Cleanup 140 133
ETL Scripting 137 128
JSON Extraction 142 131
Bulk Data Labeling 133 129
OCR / Document Parsing 137 131
Table Extraction from PDFs 137 131
Long-Document Summarization 141 137
Short-Form Summarization 128 123
Blog Post Writing 129 121

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs StepFun: Step 3.7 Flash MoonshotAI: Kimi K2.7 Code vs Qwen: Qwen3.5 Plus 2026-04-20 Qwen: Qwen3.7 Plus vs StepFun: Step 3.7 Flash Qwen: Qwen3.7 Plus vs Qwen: Qwen3.5 Plus 2026-04-20 MiniMax: MiniMax M3 vs StepFun: Step 3.7 Flash MiniMax: MiniMax M3 vs Qwen: Qwen3.5 Plus 2026-04-20 StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1 StepFun: Step 3.7 Flash vs Google: Gemini 3.5 Flash