qwen

Qwen: Qwen3 VL 235B A22B Instruct

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...

Query via API → Estimate cost

Quality Score

99/100

composite of price, context, capability

Input Price

$0.20

per 1M tokens

Output Price

$0.88

per 1M tokens

Context Window

262,144

tokens

Model ID: qwen/qwen3-vl-235b-a22b-instruct
Vendor: qwen
Tokenizer: Qwen3
Input Modalities: text, image
Output Modalities: text
Max Output: 16,384 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: not supported
Vision: ✓ accepts images
Audio: no
Moderated: no

Similar models

qwen

Qwen: Qwen3 VL 235B A22B Instruct

Similar models

Qwen: Qwen Plus 0728 (thinking)

Qwen: Qwen3.6 35B A3B

Qwen: Qwen3.5 Plus 2026-04-20

Qwen: Qwen3.6 Flash

Qwen: Qwen3.6 27B

Qwen: Qwen3.6 Plus