Llama 3.1 Tulu3 405B

Llama 3.1 Tulu3 405B

Allen Institute for AI · Released Jan 30, 2025 Emerging
Intelligence #483 / 556
14.6 Our Score
AA Index #239 / 365
14.1 Artificial Analysis
Input
Not priced
Output
Not priced
Context
Not reported

Analysis Summary

Llama 3.1 Tulu3 405B sits in the Emerging tier on our leaderboard, ranked #483 of 556 published models on overall intelligence. At $0.000 input and $0.000 output per 1M tokens, it is among the most expensive on the market.

Editorial notes

Llama 3.1 Tulu3 405B from Allen Institute shows reasonable MMLU-Pro and coding scores for a 2025 open model, but its intelligence index is low relative to current-generation alternatives.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.1Technical0Value0Content3.5
Intelligence 3.1/10
Technical 0/10
Content 3.5/10
Value 0/10

Performance Indices

Source: Artificial Analysis

14.1 Intelligence Index

Benchmark Scores

Intelligence

GPQA Diamond 51.6% Graduate-level scientific reasoning
HLE 3.5% Humanity's Last Exam
MMLU Pro 71.6% Multi-task language understanding
MATH 500 77.8% Mathematical problem-solving
AIME 13.3% Competition mathematics
SciCode 30.2% Scientific computing

Technical

LiveCodeBench 29.1% Live coding evaluation

Benchmark data from Artificial Analysis and Hugging Face

How does Llama 3.1 Tulu3 405B stack up?

Compare side-by-side with other emerging models.

Compare Models

Model Information

ProviderAllen Institute for AI
Release Date January 30, 2025
Status Active