deepseek

DeepSeek: R1 Distill Llama 70B

DeepSeek R1 Distill Llama 70B is a text-only model from DeepSeek with a 128,000-token context window and a maximum of 8,192 completion tokens per response. It supports reasoning workflows, which makes it suited for multi-step problem solving, but it does not support tool use and has no confirmed structured output capability. Input and output are both priced at $0.80 per million tokens. At that price point it sits in the budget tier, and the case for shortlisting it rests largely on cost rather than verified performance breadth; benchmark coverage spans only one independent benchmark, yielding a blended score of 15.1, so head-to-head comparisons with better-covered models are limited. Teams with straightforward text and reasoning workloads who want to keep costs low may find it worth testing, but buyers who need tool calling, structured output, or confidence from broad benchmark coverage should look elsewhere before committing.

Query via API → View on deepseek → Estimate cost

Quality Score

75/100

price + capability + benchmarks

Input Price

$0.80

per 1M tokens

Output Price

$0.80

per 1M tokens

Context Window

128,000

tokens

Model ID: deepseek/deepseek-r1-distill-llama-70b
Vendor: deepseek
Tokenizer: Llama3
Input Modalities: text
Output Modalities: text
Max Output: 8,192 tokens
Tool Calling: not supported
Structured Output: not supported
Reasoning Mode: ✓ supported
Vision: text only
Audio: no
Moderated: no

Similar models

deepseek

DeepSeek: R1 Distill Llama 70B

Similar models

DeepSeek: DeepSeek V3

DeepSeek: DeepSeek V3 0324

DeepSeek: DeepSeek V3.2

DeepSeek: R1

DeepSeek: R1 0528

DeepSeek: DeepSeek V3.1 Terminus