Anthropic: Claude Sonnet 4

Anthropic: Claude Sonnet 4

anthropic · Released May 22, 2025 Professional
Intelligence #32 / 525
77.5 Our Score
Speed #190 / 244
52.3 tokens / sec
Input #474 / 525
$3.00 per 1M tokens
Output #482 / 525
$15.00 per 1M tokens
Context #39 / 525
1M tokens

Analysis Summary

Anthropic: Claude Sonnet 4 sits in the Professional tier on our leaderboard, ranked #32 of 525 published models on overall intelligence. At $3.00 input and $15.00 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, function calling, vision, and reasoning.

Editorial notes

Anthropic's Claude Sonnet 4 is a well-rounded business model with strong reasoning, solid coding performance, vision support, a 1M token context window, and Anthropic's trusted reliability — a dependable choice for content, analysis, and agentic workflows at a mid-range price.

Assessed April 23, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence6.7Technical6.4Value6.3Content7.5
Intelligence 6.7/10
Technical 6.4/10
Content 7.5/10
Value 6.3/10

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

33 Intelligence Index
30.6 Coding Index
39.8 Agentic Index
38 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 68.3% Graduate-level scientific reasoning
HLE 4% Humanity's Last Exam
MMLU Pro 83.7% Multi-task language understanding
MATH 500 93.4% Mathematical problem-solving
AIME 40.7% Competition mathematics
AIME 2025 38% Competition mathematics (2025)
SciCode 37.3% Scientific computing

Technical

LiveCodeBench 44.9% Live coding evaluation
TerminalBench Hard 27.3% Agentic terminal tasks
τ²-Bench 52.3% Conversational agent benchmark

Content

IFBench 45.4% Instruction following
LCR 44.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Anthropic: Claude Sonnet 4 stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID anthropic/claude-sonnet-4
Provideranthropic
Release Date May 22, 2025
Context Length1,000,000 tokens
Max Completion64,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $3.00 $0.003000
Output $15.00 $0.015000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.7%
Avg Uptime
876ms
Best Latency (TTFT)
53 tok/s
Best Throughput
3/5
Active Endpoints
Available via: Google, Anthropic, Amazon Bedrock

Leaderboard Categories