meta-llama

Meta: Llama 3.2 3B Instruct

Meta: Llama 3.2 3B Instruct is a text-only model from Meta with a 131,072-token context window and a maximum completion length of 80,000 tokens. It does not support tool use, reasoning modes, or structured output, so it is best suited for straightforward text generation and comprehension tasks rather than agentic or multi-step workflows. At $0.0509 per million input tokens and $0.335 per million output tokens, it sits at the lower end of the pricing spectrum, which makes it worth considering for high-volume, cost-sensitive applications where inference budget matters more than raw capability. However, its blended benchmark score of 5.4 across only one independent benchmark gives limited confidence for direct performance comparisons, so teams requiring validated accuracy across diverse tasks should treat its benchmark standing as largely unproven and test it against their specific workloads before committing.

Quality Score
71/100
price + capability + benchmarks
Input Price
$0.05
per 1M tokens
Output Price
$0.34
per 1M tokens
Context Window
131,072
tokens
Model ID
meta-llama/llama-3.2-3b-instruct
Vendor
meta-llama
Tokenizer
Llama3
Input Modalities
text
Output Modalities
text
Max Output
80,000 tokens
Tool Calling
not supported
Structured Output
not supported
Reasoning Mode
not supported
Vision
text only
Audio
no
Moderated
no

Similar models