Home > AI Models > DeepSeek: R1 Distill Llama 70B

DeepSeek: R1 Distill Llama 70B

Name: DeepSeek: R1 Distill Llama 70B
Price: 0.7000 USD
Rating: 38 (1 reviews)
Author: deepseek

DeepSeek: R1 Distill Llama 70B

deepseek · Released Jan 23, 2025 Efficient

Our Score

Performance Profile

DeepSeek R1 Distill Llama 70B is a distilled large language model based on Llama-3.3-70B-Instruct, using outputs from DeepSeek R1. The model combines advanced distillation techniques to achieve high performance across multiple benchmarks, including: - AIME 2024 pass@1: 70.0
- MATH-500 pass@1: 94.5
- CodeForces Rating: 1633 The model leverages fine-tuning from DeepSeek R1's outputs, enabling competitive performance comparable to larger frontier models.

$0.70 / 1M

Input Price

$0.80 / 1M

Output Price

131,072 tokens

Context Window

16,384 tokens

Max Output

70B Parameters

Architecture

Modality	Text → Text
Tokenizer	`Llama3`
Instruct Type	`deepseek-r1`
Parameters	70B

Performance Indices

Source: Artificial Analysis

16 Intelligence Index

11.4 Coding Index

11.7 Agentic Index

53.7 Math Index

Benchmark Scores

GPQA Diamond 40.2% Graduate-level scientific reasoning

HLE 6.1% Humanity's Last Exam

MMLU Pro 79.5% Multi-task language understanding

MATH 500 93.5% Mathematical problem-solving

AIME 67% Competition mathematics

AIME 2025 53.7% Competition mathematics (2025)

SciCode 31.2% Scientific computing

LiveCodeBench 26.6% Live coding evaluation

TerminalBench Hard 1.5% Agentic terminal tasks

τ²-Bench 21.9% Conversational agent benchmark

IFBench 27.6% Instruction following

LCR 11% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID	`deepseek/deepseek-r1-distill-llama-70b`
Provider	deepseek
Model Family	DeepSeek
Release Date	January 23, 2025
Context Length	131,072 tokens
Max Completion	16,384 tokens
Status	Active