qwen

Qwen: Qwen3 32B

Qwen3 32B is a text-only model from Qwen with a 131,072-token context window and a 16,384-token output ceiling. It supports tool use and reasoning, which makes it usable for agentic workflows and multi-step tasks. Structured output support is unconfirmed from available data, so verify that before building pipelines that depend on it. At $0.08 per million input tokens and $0.28 per million output tokens, the model sits at the budget end of the reasoning-capable tier. Its blended benchmark score of 35.1 spans only three benchmarks, so the performance picture is limited; the available scores cover coding, general knowledge, and a qualitative reasoning measure, but broader capability gaps remain untested. Teams prioritizing low inference cost and willing to tolerate thin benchmark coverage should consider it, while those needing thoroughly validated performance across diverse tasks may want a model with wider benchmark documentation before committing.

Quality Score
91/100
price + capability + benchmarks
Input Price
$0.08
per 1M tokens
Output Price
$0.28
per 1M tokens
Context Window
131,072
tokens
Model ID
qwen/qwen3-32b
Vendor
qwen
Tokenizer
Qwen3
Input Modalities
text
Output Modalities
text
Max Output
16,384 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
✓ supported
Vision
text only
Audio
no
Moderated
no

Similar models