Home > AI Models > Anthropic: Claude Sonnet 4

Anthropic: Claude Sonnet 4

Name: Anthropic: Claude Sonnet 4 Review
Item: Anthropic: Claude Sonnet 4
Rating: 7.2
Author: Design for Online

Anthropic: Claude Sonnet 4

anthropic · Released May 22, 2025 Professional

Intelligence #55 / 556

72.2 Our Score

Speed #218 / 257

44.7 tokens / sec

Input #502 / 557

$3.00 per 1M tokens

Output #510 / 557

$15.00 per 1M tokens

Context #49 / 557

1M tokens

Anthropic: Claude Sonnet 4 sits in the Professional tier on our leaderboard, ranked #55 of 556 published models on overall intelligence. At $3.00 input and $15.00 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, function calling, vision, and reasoning.

Editorial notes

Claude Sonnet 4 from Anthropic combines a 1M token context, vision, strong instruction following, and reliable agentic performance with an intelligence index of 33; output pricing at $15 per million tokens is the main cost consideration.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: Yes
Input
Output
Context: 1M tokens
Max output: 64,000 tokens
Tokenizer: Claude
Released: May 22, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

33 Intelligence Index

30.6 Coding Index

39.8 Agentic Index

38 Math Index

Benchmark Scores

GPQA Diamond 68.3% Graduate-level scientific reasoning

HLE 4% Humanity's Last Exam

MMLU Pro 83.7% Multi-task language understanding

MATH 500 93.4% Mathematical problem-solving

AIME 40.7% Competition mathematics

AIME 2025 38% Competition mathematics (2025)

SciCode 37.3% Scientific computing

LiveCodeBench 44.9% Live coding evaluation

TerminalBench Hard 27.3% Agentic terminal tasks

τ²-Bench 52.3% Conversational agent benchmark

IFBench 45.4% Instruction following

LCR 44.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Anthropic: Claude Sonnet 4 stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID	`anthropic/claude-sonnet-4`
Provider	anthropic
Release Date	May 22, 2025
Context Length	1,000,000 tokens
Max Completion	64,000 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$3.00	$0.003000
Output	$15.00	$0.015000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

83.8%

Avg Uptime

794ms

Best Latency (TTFT)

45 tok/s

Best Throughput

4/6

Active Endpoints

Available via: Google, Amazon Bedrock, Anthropic

Leaderboard Categories

AI Agents Content Writing General Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Anthropic: Claude Sonnet 4

Anthropic: Claude Sonnet 4

Analysis Summary

Performance Profile

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Anthropic: Claude Sonnet 4

Performance Profile

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Explore Related Models