Anthropic: Claude Sonnet 4

Anthropic: Claude Sonnet 4

anthropic · Released May 22, 2025 Professional
Intelligence #55 / 556
72.2 Our Score
Speed #218 / 257
44.7 tokens / sec
Input #502 / 557
$3.00 per 1M tokens
Output #510 / 557
$15.00 per 1M tokens
Context #49 / 557
1M tokens

Analysis Summary

Anthropic: Claude Sonnet 4 sits in the Professional tier on our leaderboard, ranked #55 of 556 published models on overall intelligence. At $3.00 input and $15.00 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, function calling, vision, and reasoning.

Editorial notes

Claude Sonnet 4 from Anthropic combines a 1M token context, vision, strong instruction following, and reliable agentic performance with an intelligence index of 33; output pricing at $15 per million tokens is the main cost consideration.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence5.8Technical5.5Value6.3Content7.5
Intelligence 5.8/10
Technical 5.5/10
Content 7.5/10
Value 6.3/10

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

33 Intelligence Index
30.6 Coding Index
39.8 Agentic Index
38 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 68.3% Graduate-level scientific reasoning
HLE 4% Humanity's Last Exam
MMLU Pro 83.7% Multi-task language understanding
MATH 500 93.4% Mathematical problem-solving
AIME 40.7% Competition mathematics
AIME 2025 38% Competition mathematics (2025)
SciCode 37.3% Scientific computing

Technical

LiveCodeBench 44.9% Live coding evaluation
TerminalBench Hard 27.3% Agentic terminal tasks
τ²-Bench 52.3% Conversational agent benchmark

Content

IFBench 45.4% Instruction following
LCR 44.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Anthropic: Claude Sonnet 4 stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID anthropic/claude-sonnet-4
Provideranthropic
Release Date May 22, 2025
Context Length1,000,000 tokens
Max Completion64,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $3.00 $0.003000
Output $15.00 $0.015000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

83.8%
Avg Uptime
794ms
Best Latency (TTFT)
45 tok/s
Best Throughput
4/6
Active Endpoints
Available via: Google, Amazon Bedrock, Anthropic

Leaderboard Categories