Meta: Llama 4 Scout

Meta: Llama 4 Scout

meta-llama · Released Apr 5, 2025 Efficient
Intelligence #190 / 551
42.8 Our Score
Speed #74 / 257
136.4 tokens / sec
Input #174 / 552
$0.080 per 1M tokens
Output #199 / 552
$0.300 per 1M tokens
Context #88 / 552
327,680 tokens

Analysis Summary

Meta: Llama 4 Scout sits in the Efficient tier on our leaderboard, ranked #190 of 551 published models on overall intelligence. At $0.080 input and $0.300 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use, function calling, and vision.

Editorial notes

Meta Llama 4 Scout delivers vision, tool use, and a 327K context at very low pricing, with reasonable GPQA and MMLU scores, though coding and agentic benchmarks are limited.

Assessed May 5, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.1Technical1.5Value8Content4.5
Intelligence 3.1/10
Technical 1.5/10
Content 4.5/10
Value 8/10

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

13.5 Intelligence Index
6.7 Coding Index
8.5 Agentic Index
14 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 58.7% Graduate-level scientific reasoning
HLE 4.3% Humanity's Last Exam
MMLU Pro 75.2% Multi-task language understanding
MATH 500 84.4% Mathematical problem-solving
AIME 28.3% Competition mathematics
AIME 2025 14% Competition mathematics (2025)
SciCode 17% Scientific computing

Technical

LiveCodeBench 29.9% Live coding evaluation
TerminalBench Hard 1.5% Agentic terminal tasks
τ²-Bench 15.5% Conversational agent benchmark

Content

IFBench 39.5% Instruction following
LCR 25.8% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Meta: Llama 4 Scout stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID meta-llama/llama-4-scout
Providermeta-llama
Model FamilyLlama 4
Release Date April 5, 2025
Context Length327,680 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.08 $0.000080
Output $0.30 $0.000300

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
106ms
Best Latency (TTFT)
363 tok/s
Best Throughput
4/4
Active Endpoints
Available via: DeepInfra, Groq, Novita, Google

Leaderboard Categories