Qwen: Qwen3 32B
Qwen3 32B is a text-only model from Qwen with a 131,072-token context window and a 16,384-token output ceiling. It supports tool use and reasoning, which makes it usable for agentic workflows and multi-step tasks. Structured output support is unconfirmed from available data, so verify that before building pipelines that depend on it. At $0.08 per million input tokens and $0.28 per million output tokens, the model sits at the budget end of the reasoning-capable tier. Its blended benchmark score of 35.1 spans only three benchmarks, so the performance picture is limited; the available scores cover coding, general knowledge, and a qualitative reasoning measure, but broader capability gaps remain untested. Teams prioritizing low inference cost and willing to tolerate thin benchmark coverage should consider it, while those needing thoroughly validated performance across diverse tasks may want a model with wider benchmark documentation before committing.
- Model ID
- qwen/qwen3-32b
- Vendor
- qwen
- Tokenizer
- Qwen3
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- 16,384 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- text only
- Audio
- no
- Moderated
- no