qwen

Qwen: Qwen3 Coder Flash

Qwen3 Coder Flash is a text-in, text-out model from Qwen with a 1-million-token context window and a 65,536-token output ceiling. It supports tool use, which makes it viable for agentic coding workflows, but it lacks native reasoning mode and has no confirmed structured output support. Input is text only, so multimodal tasks are out of scope. At $0.195 per million input tokens and $0.975 per million output tokens, it sits in the budget-to-mid tier for coding-focused models. The output price is the more meaningful cost driver for long code generation runs, so high-volume users should model that carefully. There is currently no independent benchmark coverage, meaning quality relative to competitors is unproven. Developers who need a large context window for codebase-scale tasks and want to keep costs low may find it worth testing, but anyone requiring validated performance data before committing should wait for third-party evaluations to emerge.

Query via API → View on qwen → Estimate cost

Quality Score

94/100

price + capability + benchmarks

Input Price

$0.20

per 1M tokens

Output Price

$0.97

per 1M tokens

Context Window

1,000,000

tokens

Model ID: qwen/qwen3-coder-flash
Vendor: qwen
Tokenizer: Qwen3
Input Modalities: text
Output Modalities: text
Max Output: 65,536 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: not supported
Vision: text only
Audio: no
Moderated: no

Similar models

qwen

Qwen: Qwen3 Coder Flash

Similar models

Qwen: Qwen3 Next 80B A3B Instruct

Qwen: Qwen3 Coder Next

Qwen: Qwen Plus 0728

Qwen: Qwen-Plus

Qwen: Qwen3 Coder 480B A35B

Qwen: Qwen3 235B A22B Instruct 2507