qwen

Qwen: Qwen3 Coder Flash

Qwen3 Coder Flash is a text-in, text-out model from Qwen with a 1-million-token context window and a 65,536-token output ceiling. It supports tool use, which makes it viable for agentic coding workflows, but it lacks native reasoning mode and has no confirmed structured output support. Input is text only, so multimodal tasks are out of scope. At $0.195 per million input tokens and $0.975 per million output tokens, it sits in the budget-to-mid tier for coding-focused models. The output price is the more meaningful cost driver for long code generation runs, so high-volume users should model that carefully. There is currently no independent benchmark coverage, meaning quality relative to competitors is unproven. Developers who need a large context window for codebase-scale tasks and want to keep costs low may find it worth testing, but anyone requiring validated performance data before committing should wait for third-party evaluations to emerge.

Quality Score
94/100
price + capability + benchmarks
Input Price
$0.20
per 1M tokens
Output Price
$0.97
per 1M tokens
Context Window
1,000,000
tokens
Model ID
qwen/qwen3-coder-flash
Vendor
qwen
Tokenizer
Qwen3
Input Modalities
text
Output Modalities
text
Max Output
65,536 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
not supported
Vision
text only
Audio
no
Moderated
no

Similar models