Qwen: Qwen3 8B
Qwen3 8B is a text-only model from Qwen with a 131,072-token context window and a maximum of 8,192 output tokens per completion. It supports tool use and reasoning, which makes it capable of multi-step tasks and agentic workflows. Structured output support is unconfirmed based on available data. At $0.05 per million input tokens and $0.40 per million output tokens, Qwen3 8B sits at the affordable end of the market, making cost a genuine argument in its favor. However, its benchmark standing is difficult to evaluate with confidence: a blended score of 20.3 across only two benchmarks offers limited signal, and buyers who need reliable performance comparisons should treat that figure as preliminary rather than settled. It is best suited for developers running high-volume, text-based workloads where keeping costs low matters more than proven benchmark standing.
- Model ID
- qwen/qwen3-8b
- Vendor
- qwen
- Tokenizer
- Qwen3
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- 8,192 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- text only
- Audio
- no
- Moderated
- no