meta-llama

Meta: Llama 3.2 3B Instruct

Meta: Llama 3.2 3B Instruct is a text-only model from Meta with a 131,072-token context window and a maximum completion length of 80,000 tokens. It does not support tool use, reasoning modes, or structured output, so it is best suited for straightforward text generation and comprehension tasks rather than agentic or multi-step workflows. At $0.0509 per million input tokens and $0.335 per million output tokens, it sits at the lower end of the pricing spectrum, which makes it worth considering for high-volume, cost-sensitive applications where inference budget matters more than raw capability. However, its blended benchmark score of 5.4 across only one independent benchmark gives limited confidence for direct performance comparisons, so teams requiring validated accuracy across diverse tasks should treat its benchmark standing as largely unproven and test it against their specific workloads before committing.

Query via API → View on meta-llama → Estimate cost

Quality Score

71/100

price + capability + benchmarks

Input Price

$0.05

per 1M tokens

Output Price

$0.34

per 1M tokens

Context Window

131,072

tokens

Model ID: meta-llama/llama-3.2-3b-instruct
Vendor: meta-llama
Tokenizer: Llama3
Input Modalities: text
Output Modalities: text
Max Output: 80,000 tokens
Tool Calling: not supported
Structured Output: not supported
Reasoning Mode: not supported
Vision: text only
Audio: no
Moderated: no

Similar models

meta-llama

Meta: Llama 3.2 3B Instruct

Similar models

Meta: Llama 3.2 1B Instruct

Meta: Llama 3.3 70B Instruct (free)

Meta: Llama 3.2 3B Instruct (free)

Meta: Llama 3.2 11B Vision Instruct

Meta: Llama 3 8B Instruct

Meta: Llama Guard 4 12B