Qwen: Qwen3 Coder Flash
Qwen3 Coder Flash is a text-in, text-out model from Qwen with a 1-million-token context window and a 65,536-token output ceiling. It supports tool use, which makes it viable for agentic coding workflows, but it lacks native reasoning mode and has no confirmed structured output support. Input is text only, so multimodal tasks are out of scope. At $0.195 per million input tokens and $0.975 per million output tokens, it sits in the budget-to-mid tier for coding-focused models. The output price is the more meaningful cost driver for long code generation runs, so high-volume users should model that carefully. There is currently no independent benchmark coverage, meaning quality relative to competitors is unproven. Developers who need a large context window for codebase-scale tasks and want to keep costs low may find it worth testing, but anyone requiring validated performance data before committing should wait for third-party evaluations to emerge.
- Model ID
- qwen/qwen3-coder-flash
- Vendor
- qwen
- Tokenizer
- Qwen3
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- 65,536 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- not supported
- Vision
- text only
- Audio
- no
- Moderated
- no