qwen

Qwen: Qwen3 VL 235B A22B Instruct

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...

Quality Score
99/100
composite of price, context, capability
Input Price
$0.20
per 1M tokens
Output Price
$0.88
per 1M tokens
Context Window
262,144
tokens
Model ID
qwen/qwen3-vl-235b-a22b-instruct
Vendor
qwen
Tokenizer
Qwen3
Input Modalities
text, image
Output Modalities
text
Max Output
16,384 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
not supported
Vision
✓ accepts images
Audio
no
Moderated
no

Similar models