meta-llama

Meta: Llama 3.1 8B Instruct

Meta: Llama 3.1 8B Instruct is a text-in, text-out model from Meta with a 131,072-token context window and a maximum completion length of 16,384 tokens. It supports tool use, which makes it usable in agentic and function-calling workflows, though it does not include a reasoning mode and structured output support is unconfirmed. Input is text only, so it is not suited for image or multimodal tasks. At $0.02 per million input tokens and $0.03 per million output tokens, this is one of the lower-cost options available, making it worth considering for high-volume or cost-sensitive workloads. Its blended benchmark score of 8.6 is drawn from only one independent benchmark, so performance comparisons should be treated as preliminary rather than definitive. Teams that need a tool-capable model on a tight budget and can tolerate limited third-party validation are the most practical audience here.

Query via API → View on meta-llama → Estimate cost

Quality Score

86/100

price + capability + benchmarks

Input Price

$0.02

per 1M tokens

Output Price

$0.03

per 1M tokens

Context Window

131,072

tokens

Model ID: meta-llama/llama-3.1-8b-instruct
Vendor: meta-llama
Tokenizer: Llama3
Input Modalities: text
Output Modalities: text
Max Output: 16,384 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: not supported
Vision: text only
Audio: no
Moderated: no

Similar models

meta-llama

Meta: Llama 3.1 8B Instruct

Similar models

Meta: Llama 3.3 70B Instruct

Meta: Llama 3.1 70B Instruct

Meta: Llama Guard 4 12B

Meta: Llama 3.2 11B Vision Instruct

Meta: Llama 3.3 70B Instruct (free)

Meta: Llama 4 Maverick