Meta: Llama 4 Maverick

Meta: Llama 4 Maverick

meta-llama · Released Apr 5, 2025 Professional
Intelligence #10 / 565
82.0 Our Score
Speed #109 / 262
119.7 tokens / sec
Input #242 / 565
$0.150 per 1M tokens
Output #264 / 565
$0.600 per 1M tokens
Context #17 / 565
1M tokens

Analysis Summary

Meta: Llama 4 Maverick sits in the Professional tier on our leaderboard, ranked #10 of 565 published models on overall intelligence. At $0.150 input and $0.600 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, function calling, and vision.

Editorial notes

Meta Llama 4 Maverick pairs a 1M token context with vision, tool use, strong instruction-following, and competitive pricing, though its intelligence and agentic scores are moderate for the price tier.

Assessed May 31, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.4Technical2.4Value8.3Content5.1
Intelligence 3.4/10
Technical 2.4/10
Content 5.1/10
Value 8.3/10

How Meta: Llama 4 Maverick compares

Meta: Llama 4 Maverick ranks #184 of 370 AI models we track for overall intelligence, #169 of 307 for coding, #235 of 282 for agentic tasks. Its 1M-token context window is larger than 97% of the models we list. At $0.15 per million input tokens it is cheaper than 57% of comparable models.

About Meta: Llama 4 Maverick

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

18.4 Intelligence Index
15.6 Coding Index
12.3 Agentic Index
19.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 67.1% Graduate-level scientific reasoning
HLE 4.8% Humanity's Last Exam
MMLU Pro 80.9% Multi-task language understanding
MATH 500 88.9% Mathematical problem-solving
AIME 39% Competition mathematics
AIME 2025 19.3% Competition mathematics (2025)
SciCode 33.1% Scientific computing

Technical

LiveCodeBench 39.7% Live coding evaluation
TerminalBench Hard 6.8% Agentic terminal tasks
τ²-Bench 17.8% Conversational agent benchmark

Content

IFBench 43% Instruction following
LCR 46% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Meta: Llama 4 Maverick stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID meta-llama/llama-4-maverick
Providermeta-llama
Model FamilyLlama 4
Release Date April 5, 2025
Context Length1,048,576 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.15 $0.000150
Output $0.60 $0.000600

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

98.8%
Avg Uptime
304ms
Best Latency (TTFT)
65 tok/s
Best Throughput
5/5
Active Endpoints
Available via: DeepInfra, Novita, Parasail, Google, SambaNova

Leaderboard Categories

Frequently asked questions about Meta: Llama 4 Maverick

How much does Meta: Llama 4 Maverick cost?

Meta: Llama 4 Maverick costs $0.15 per million input tokens and $0.60 per million output tokens.

What is the context window of Meta: Llama 4 Maverick?

Meta: Llama 4 Maverick has a context window of 1,048,576 tokens (1M).

Is Meta: Llama 4 Maverick good for coding?

On our coding benchmark index, Meta: Llama 4 Maverick ranks #169 of 307 models, placing it in the broader range of the field for code generation and debugging.

What can Meta: Llama 4 Maverick do?

Meta: Llama 4 Maverick supports image/vision input, tool use, and function calling.

Who created Meta: Llama 4 Maverick?

Meta: Llama 4 Maverick is developed by Meta and was released on April 5, 2025.