qwen

Qwen: Qwen3 32B

Qwen3 32B is a text-only model from Qwen with a 131,072-token context window and a 16,384-token output ceiling. It supports tool use and reasoning, which makes it usable for agentic workflows and multi-step tasks. Structured output support is unconfirmed from available data, so verify that before building pipelines that depend on it. At $0.08 per million input tokens and $0.28 per million output tokens, the model sits at the budget end of the reasoning-capable tier. Its blended benchmark score of 35.1 spans only three benchmarks, so the performance picture is limited; the available scores cover coding, general knowledge, and a qualitative reasoning measure, but broader capability gaps remain untested. Teams prioritizing low inference cost and willing to tolerate thin benchmark coverage should consider it, while those needing thoroughly validated performance across diverse tasks may want a model with wider benchmark documentation before committing.

Query via API → View on qwen → Estimate cost

Quality Score

91/100

price + capability + benchmarks

Input Price

$0.08

per 1M tokens

Output Price

$0.28

per 1M tokens

Context Window

131,072

tokens

Model ID: qwen/qwen3-32b
Vendor: qwen
Tokenizer: Qwen3
Input Modalities: text
Output Modalities: text
Max Output: 16,384 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: ✓ supported
Vision: text only
Audio: no
Moderated: no

Similar models

qwen

Qwen: Qwen3 32B

Similar models

Qwen: Qwen3 8B

Qwen: Qwen3 14B

Qwen: Qwen3 30B A3B Thinking 2507

Qwen: Qwen3 30B A3B

Qwen: Qwen3 235B A22B

Qwen: Qwen3 Coder 30B A3B Instruct