Anthropic: Claude Sonnet 4

Anthropic: Claude Sonnet 4

anthropic · Released May 22, 2025 Professional
Intelligence #53 / 544
72.2 Our Score
Speed #212 / 257
46.9 tokens / sec
Input #497 / 551
$3.00 per 1M tokens
Output #505 / 551
$15.00 per 1M tokens
Context #44 / 551
1M tokens

Analysis Summary

Anthropic: Claude Sonnet 4 sits in the Professional tier on our leaderboard, ranked #53 of 544 published models on overall intelligence. At $3.00 input and $15.00 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, function calling, vision, and reasoning.

Editorial notes

Claude Sonnet 4 from Anthropic delivers strong instruction following, vision, tool use, and a 1M token context with reliable agentic performance, making it a capable choice for client-facing content and workflow automation.

Assessed May 5, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence5.8Technical5.5Value6.3Content7.5
Intelligence 5.8/10
Technical 5.5/10
Content 7.5/10
Value 6.3/10

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

33 Intelligence Index
30.6 Coding Index
39.8 Agentic Index
38 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 68.3% Graduate-level scientific reasoning
HLE 4% Humanity's Last Exam
MMLU Pro 83.7% Multi-task language understanding
MATH 500 93.4% Mathematical problem-solving
AIME 40.7% Competition mathematics
AIME 2025 38% Competition mathematics (2025)
SciCode 37.3% Scientific computing

Technical

LiveCodeBench 44.9% Live coding evaluation
TerminalBench Hard 27.3% Agentic terminal tasks
τ²-Bench 52.3% Conversational agent benchmark

Content

IFBench 45.4% Instruction following
LCR 44.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Anthropic: Claude Sonnet 4 stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID anthropic/claude-sonnet-4
Provideranthropic
Release Date May 22, 2025
Context Length1,000,000 tokens
Max Completion64,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $3.00 $0.003000
Output $15.00 $0.015000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
761ms
Best Latency (TTFT)
42 tok/s
Best Throughput
1/6
Active Endpoints
Available via: Google, Amazon Bedrock, Anthropic

Leaderboard Categories