mistralai

Mistral: Mistral Small 3.2 24B

Mistral Small 3.2 24B is a multimodal model from Mistral AI that accepts both image and text inputs, supports tool use, and offers a 128,000-token context window with up to 16,384 tokens of output per completion. It does not include a dedicated reasoning mode, and structured output support is unconfirmed. The combination of vision capability and tool use makes it broadly applicable to agentic and document-processing workflows without requiring a larger, more expensive model. At $0.075 per million input tokens and $0.20 per million output tokens, it sits in the budget-to-midrange tier, making it worth considering for teams running high-volume inference who need image understanding alongside text. Benchmark coverage is thin, with a blended score of 59.6 drawn from a single benchmark, so performance claims should be treated as provisional rather than well-established. Teams that need wider benchmark validation before committing should treat it as a candidate to test rather than a settled choice.

Quality Score
90/100
price + capability + benchmarks
Input Price
$0.07
per 1M tokens
Output Price
$0.20
per 1M tokens
Context Window
128,000
tokens
Model ID
mistralai/mistral-small-3.2-24b-instruct
Vendor
mistralai
Tokenizer
Mistral
Input Modalities
image, text
Output Modalities
text
Max Output
16,384 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
not supported
Vision
✓ accepts images
Audio
no
Moderated
no

Similar models