Meta: Llama 4 Maverick

Meta: Llama 4 Maverick

meta-llama · Released Apr 5, 2025 Efficient
Intelligence #155 / 551
48.4 Our Score
Speed #105 / 257
113.2 tokens / sec
Input #237 / 552
$0.150 per 1M tokens
Output #260 / 552
$0.600 per 1M tokens
Context #15 / 552
1M tokens

Analysis Summary

Meta: Llama 4 Maverick sits in the Efficient tier on our leaderboard, ranked #155 of 551 published models on overall intelligence. At $0.150 input and $0.600 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports vision.

Editorial notes

Llama 4 Maverick from Meta brings vision support and a 1M token context at very low pricing, but its intelligence and agentic indices are weak, limiting its suitability for complex business tasks.

Assessed May 5, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.9Technical2.4Value8.3Content5
Intelligence 3.9/10
Technical 2.4/10
Content 5/10
Value 8.3/10

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward..

Capabilities

Vision

Performance Indices

Source: Artificial Analysis

18.4 Intelligence Index
15.6 Coding Index
12.3 Agentic Index
19.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 67.1% Graduate-level scientific reasoning
HLE 4.8% Humanity's Last Exam
MMLU Pro 80.9% Multi-task language understanding
MATH 500 88.9% Mathematical problem-solving
AIME 39% Competition mathematics
AIME 2025 19.3% Competition mathematics (2025)
SciCode 33.1% Scientific computing

Technical

LiveCodeBench 39.7% Live coding evaluation
TerminalBench Hard 6.8% Agentic terminal tasks
τ²-Bench 17.8% Conversational agent benchmark

Content

IFBench 43% Instruction following
LCR 46% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Meta: Llama 4 Maverick stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID meta-llama/llama-4-maverick
Providermeta-llama
Model FamilyLlama 4
Release Date April 5, 2025
Context Length1,048,576 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.15 $0.000150
Output $0.60 $0.000600

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.6%
Avg Uptime
260ms
Best Latency (TTFT)
60 tok/s
Best Throughput
4/4
Active Endpoints
Available via: DeepInfra, Novita, Parasail, SambaNova

Leaderboard Categories