meta-llama

Meta: Llama 3.1 8B Instruct

Meta: Llama 3.1 8B Instruct is a text-in, text-out model from Meta with a 131,072-token context window and a maximum completion length of 16,384 tokens. It supports tool use, which makes it usable in agentic and function-calling workflows, though it does not include a reasoning mode and structured output support is unconfirmed. Input is text only, so it is not suited for image or multimodal tasks. At $0.02 per million input tokens and $0.03 per million output tokens, this is one of the lower-cost options available, making it worth considering for high-volume or cost-sensitive workloads. Its blended benchmark score of 8.6 is drawn from only one independent benchmark, so performance comparisons should be treated as preliminary rather than definitive. Teams that need a tool-capable model on a tight budget and can tolerate limited third-party validation are the most practical audience here.

Quality Score
86/100
price + capability + benchmarks
Input Price
$0.02
per 1M tokens
Output Price
$0.03
per 1M tokens
Context Window
131,072
tokens
Model ID
meta-llama/llama-3.1-8b-instruct
Vendor
meta-llama
Tokenizer
Llama3
Input Modalities
text
Output Modalities
text
Max Output
16,384 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
not supported
Vision
text only
Audio
no
Moderated
no

Similar models