Meta: Llama 3.1 70B Instruct

Meta: Llama 3.1 70B Instruct

meta-llama · Released Jul 23, 2024 Efficient
Intelligence #216 / 544
39.0 Our Score
Speed #243 / 252
31.0 tokens / sec
Input #336 / 544
$0.400 per 1M tokens
Output #211 / 544
$0.400 per 1M tokens
Context #202 / 544
131,072 tokens

Analysis Summary

Meta: Llama 3.1 70B Instruct sits in the Efficient tier on our leaderboard, ranked #216 of 544 published models on overall intelligence. At $0.400 input and $0.400 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports tool use and function calling.

Editorial notes

Llama 3.1 70B Instruct from Meta delivers moderate general capability with tool use and a 131K context window at low cost, though benchmark scores indicate limited reasoning depth.

Assessed May 5, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence2.8Technical1.9Value7.8Content3.5
Intelligence 2.8/10
Technical 1.9/10
Content 3.5/10
Value 7.8/10

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong..

70B Parameters

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Typellama3

Performance Indices

Source: Artificial Analysis

12.5 Intelligence Index
10.9 Coding Index
9.1 Agentic Index
4 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 40.9% Graduate-level scientific reasoning
HLE 4.6% Humanity's Last Exam
MMLU Pro 67.6% Multi-task language understanding
MATH 500 64.9% Mathematical problem-solving
AIME 17.3% Competition mathematics
AIME 2025 4% Competition mathematics (2025)
SciCode 26.7% Scientific computing

Technical

LiveCodeBench 23.2% Live coding evaluation
TerminalBench Hard 3% Agentic terminal tasks
τ²-Bench 15.2% Conversational agent benchmark

Content

IFBench 34.4% Instruction following
LCR 6.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Meta: Llama 3.1 70B Instruct stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID meta-llama/llama-3.1-70b-instruct
Providermeta-llama
Model FamilyLlama 3
Release Date July 23, 2024
Context Length131,072 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.40 $0.000400
Output $0.40 $0.000400

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.8%
Avg Uptime
200ms
Best Latency (TTFT)
27 tok/s
Best Throughput
4/4
Active Endpoints
Available via: DeepInfra, Amazon Bedrock, WandB

Leaderboard Categories