Meta: Llama 3.1 405B (base)

Meta: Llama 3.1 405B (base)

meta-llama · Released Aug 2, 2024 Efficient
Intelligence #215 / 544
39.1 Our Score
Speed #243 / 254
31.6 tokens / sec
Input #509 / 546
$4.00 per 1M tokens
Output #426 / 546
$4.00 per 1M tokens
Context #358 / 546
32,768 tokens

Analysis Summary

Meta: Llama 3.1 405B (base) sits in the Efficient tier on our leaderboard, ranked #215 of 544 published models on overall intelligence. At $4.00 input and $4.00 output per 1M tokens, it is among the most expensive on the market. It offers a mid-sized context window.

Editorial notes

Llama 3.1 405B base from Meta shows reasonable benchmark scores for a base model but lacks instruction tuning, tool use, and has a limited 32K context window at premium pricing.

Assessed May 5, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.5Technical2.2Value6Content4.5
Intelligence 3.5/10
Technical 2.2/10
Content 4.5/10
Value 6/10

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This is the base 405B pre-trained version. It has demonstrated strong performance compared to leading closed-source models in human evaluations. Usage of this model is subject to Meta's Acceptable Use Policy.

405B Parameters

Architecture Detail

Instruct Typenone

Performance Indices

Source: Artificial Analysis

17.4 Intelligence Index
14.5 Coding Index
12.9 Agentic Index
3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 51.5% Graduate-level scientific reasoning
HLE 4.2% Humanity's Last Exam
MMLU Pro 73.2% Multi-task language understanding
MATH 500 70.3% Mathematical problem-solving
AIME 21.3% Competition mathematics
AIME 2025 3% Competition mathematics (2025)
SciCode 29.9% Scientific computing

Technical

LiveCodeBench 30.5% Live coding evaluation
TerminalBench Hard 6.8% Agentic terminal tasks
τ²-Bench 19% Conversational agent benchmark

Content

IFBench 39% Instruction following
LCR 24.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Meta: Llama 3.1 405B (base) stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID meta-llama/llama-3.1-405b
Providermeta-llama
Model FamilyLlama 3
Release Date August 2, 2024
Context Length32,768 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $4.00 $0.004000
Output $4.00 $0.004000