Home > AI Models > Anthropic: Claude Opus 4

Anthropic: Claude Opus 4

Name: Anthropic: Claude Opus 4 Review
Item: Anthropic: Claude Opus 4
Rating: 7.5
Author: Design for Online

Anthropic: Claude Opus 4

anthropic · Released May 22, 2025 Professional

Intelligence #42 / 556

74.9 Our Score

Speed #231 / 257

37.3 tokens / sec

Input #543 / 557

$15.00 per 1M tokens

Output #547 / 557

$75.00 per 1M tokens

Context #183 / 557

200,000 tokens

Anthropic: Claude Opus 4 sits in the Professional tier on our leaderboard, ranked #42 of 556 published models on overall intelligence. At $15.00 input and $75.00 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use, function calling, vision, and reasoning.

Editorial notes

Claude Opus 4 from Anthropic delivers strong reasoning, a 52.2 agentic index, vision support, and best-in-class instruction following at an intelligence index of 39; $15/$75 per million token pricing is the primary constraint for cost-sensitive deployments.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: Yes
Input
Output
Context: 200,000 tokens
Max output: 32,000 tokens
Tokenizer: Claude
Released: May 22, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

39 Intelligence Index

34 Coding Index

52.2 Agentic Index

73.3 Math Index

Benchmark Scores

GPQA Diamond 79.6% Graduate-level scientific reasoning

HLE 11.7% Humanity's Last Exam

MMLU Pro 87.3% Multi-task language understanding

MATH 500 98.2% Mathematical problem-solving

AIME 75.7% Competition mathematics

AIME 2025 73.3% Competition mathematics (2025)

SciCode 39.8% Scientific computing

LiveCodeBench 63.6% Live coding evaluation

TerminalBench Hard 31.1% Agentic terminal tasks

τ²-Bench 73.4% Conversational agent benchmark

IFBench 53.7% Instruction following

LCR 33.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Anthropic: Claude Opus 4 stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID	`anthropic/claude-opus-4`
Provider	anthropic
Release Date	May 22, 2025
Context Length	200,000 tokens
Max Completion	32,000 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$15.00	$0.015000
Output	$75.00	$0.075000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.5%

Avg Uptime

1,407ms

Best Latency (TTFT)

13 tok/s

Best Throughput

1/3

Active Endpoints

Available via: Google, Anthropic, Amazon Bedrock

Leaderboard Categories

AI Agents Coding Content Writing General Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Anthropic: Claude Opus 4

Anthropic: Claude Opus 4

Analysis Summary

Performance Profile

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Anthropic: Claude Opus 4

Performance Profile

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Explore Related Models