qwen

Qwen: Qwen3 8B

Qwen3 8B is a text-only model from Qwen with a 131,072-token context window and a maximum of 8,192 output tokens per completion. It supports tool use and reasoning, which makes it capable of multi-step tasks and agentic workflows. Structured output support is unconfirmed based on available data. At $0.05 per million input tokens and $0.40 per million output tokens, Qwen3 8B sits at the affordable end of the market, making cost a genuine argument in its favor. However, its benchmark standing is difficult to evaluate with confidence: a blended score of 20.3 across only two benchmarks offers limited signal, and buyers who need reliable performance comparisons should treat that figure as preliminary rather than settled. It is best suited for developers running high-volume, text-based workloads where keeping costs low matters more than proven benchmark standing.

Quality Score
91/100
price + capability + benchmarks
Input Price
$0.05
per 1M tokens
Output Price
$0.40
per 1M tokens
Context Window
131,072
tokens
Model ID
qwen/qwen3-8b
Vendor
qwen
Tokenizer
Qwen3
Input Modalities
text
Output Modalities
text
Max Output
8,192 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
✓ supported
Vision
text only
Audio
no
Moderated
no

Similar models