head-to-head

StepFun: Step 3.7 Flash vs Google: Gemma 4 26B A4B

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-23.

StepFun: Step 3.7 Flash Google: Gemma 4 26B A4B
Vendorstepfungoogle
Quality Score100100
Benchmark Score48.051.0
Input Price$0.20/M$0.06/M
Output Price$1.15/M$0.33/M
Context Window256,000262,144
Max Output256,000-
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index49.142.4
ai_index_agentic35.518.1
ai_index_coding61.664.9
eqbench-70.0

Who wins by task?

TaskStepFun: Step 3.7 FlashGoogle: Gemma 4 26B A4B
SQL Generation 152 154
Code Review 145 151
Code Completion 129 132
Code Refactoring 143 150
Bug Fixing 154 158
Unit Test Generation 138 141
Code Documentation 132 137
Regex Writing 129 130
CI/CD Pipelines 131 134
Frontend Component Design 135 137
CSV / Spreadsheet Cleanup 140 144
ETL Scripting 137 142
JSON Extraction 142 142
Bulk Data Labeling 133 133
OCR / Document Parsing 137 139
Table Extraction from PDFs 137 139
Long-Document Summarization 141 149
Short-Form Summarization 128 129
Blog Post Writing 129 132

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs StepFun: Step 3.7 Flash MoonshotAI: Kimi K2.7 Code vs Google: Gemma 4 26B A4B Qwen: Qwen3.7 Plus vs StepFun: Step 3.7 Flash Qwen: Qwen3.7 Plus vs Google: Gemma 4 26B A4B MiniMax: MiniMax M3 vs StepFun: Step 3.7 Flash MiniMax: MiniMax M3 vs Google: Gemma 4 26B A4B StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1 StepFun: Step 3.7 Flash vs Google: Gemini 3.5 Flash