Home > AI Models > Anthropic: Claude 3.7 Sonnet (thinking)

Anthropic: Claude 3.7 Sonnet (thinking)

Name: Anthropic: Claude 3.7 Sonnet (thinking) Review
Item: Anthropic: Claude 3.7 Sonnet (thinking)
Rating: 7
Author: Design for Online

Anthropic: Claude 3.7 Sonnet (thinking)

anthropic · Released Feb 24, 2025 Specialist

Intelligence #63 / 556

69.6 Our Score

AA Index #71 / 365

34.7 Artificial Analysis

Input #501 / 556

$3.00 per 1M tokens

Output #509 / 556

$15.00 per 1M tokens

Context #171 / 556

200,000 tokens

Anthropic: Claude 3.7 Sonnet (thinking) sits in the Specialist tier on our leaderboard, ranked #63 of 556 published models on overall intelligence. At $3.00 input and $15.00 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use, function calling, vision, and reasoning.

Editorial notes

Claude 3.7 Sonnet (thinking) from Anthropic combines extended reasoning with strong agentic and coding capability, vision support, and a 200K context, well-suited to complex business and editorial tasks.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: Yes
Input
Output
Context: 200,000 tokens
Max output: 64,000 tokens
Tokenizer: Claude
Released: Feb 24, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

Claude 3.7 Sonnet is an advanced large language model with improved reasoning, coding, and problem-solving capabilities. It introduces a hybrid reasoning approach, allowing users to choose between rapid responses and..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

34.7 Intelligence Index

27.6 Coding Index

37.9 Agentic Index

56.3 Math Index

Benchmark Scores

GPQA Diamond 77.2% Graduate-level scientific reasoning

HLE 10.3% Humanity's Last Exam

MMLU Pro 83.7% Multi-task language understanding

MATH 500 94.7% Mathematical problem-solving

AIME 48.7% Competition mathematics

AIME 2025 56.3% Competition mathematics (2025)

SciCode 40.3% Scientific computing

LiveCodeBench 47.3% Live coding evaluation

TerminalBench Hard 21.2% Agentic terminal tasks

τ²-Bench 54.7% Conversational agent benchmark

IFBench 48.3% Instruction following

LCR 60.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Anthropic: Claude 3.7 Sonnet (thinking) stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID	`anthropic/claude-3.7-sonnet:thinking`
Provider	anthropic
Model Family	Claude 3
Release Date	February 24, 2025
Context Length	200,000 tokens
Max Completion	64,000 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$3.00	$0.003000
Output	$15.00	$0.015000

Leaderboard Categories

AI Agents Coding Content Writing Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Anthropic: Claude 3.7 Sonnet (thinking)

Anthropic: Claude 3.7 Sonnet (thinking)

Analysis Summary

Performance Profile

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Leaderboard Categories

External Resources

Anthropic: Claude 3.7 Sonnet (thinking)

Performance Profile

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Leaderboard Categories

External Resources

Explore Related Models