DeepSeek: R1

DeepSeek: R1

deepseek · Released Jan 20, 2025 Specialist
Intelligence #112 / 523
54.2 Our Score
AA Index #107 / 351
27.1 Artificial Analysis
Input #373 / 523
$0.700 per 1M tokens
Output #376 / 523
$2.50 per 1M tokens
Context #326 / 523
64,000 tokens

Analysis Summary

DeepSeek: R1 sits in the Specialist tier on our leaderboard, ranked #112 of 523 published models on overall intelligence. At $0.700 input and $2.50 output per 1M tokens, it is among the most expensive on the market. It offers a mid-sized context window and supports tool use, function calling, and reasoning.

Editorial notes

DeepSeek R1 is a highly capable reasoning model with outstanding maths and coding benchmark scores, tool use support, and competitive pricing — offering exceptional value for technically demanding tasks. Businesses should be aware that provider accessibility and data residency considerations may apply depending on their compliance requirements.

Assessed April 23, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence5.4Technical4.5Value6.5Content5.5
Intelligence 5.4/10
Technical 4.5/10
Content 5.5/10
Value 6.5/10

DeepSeek R1 is here: Performance on par with OpenAI o1, but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass..

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Typedeepseek-r1

Performance Indices

Source: Artificial Analysis

27.1 Intelligence Index
24 Coding Index
26.2 Agentic Index
76 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 81.3% Graduate-level scientific reasoning
HLE 14.9% Humanity's Last Exam
MMLU Pro 84.9% Multi-task language understanding
MATH 500 98.3% Mathematical problem-solving
AIME 89.3% Competition mathematics
AIME 2025 76% Competition mathematics (2025)
SciCode 40.3% Scientific computing

Technical

LiveCodeBench 77% Live coding evaluation
TerminalBench Hard 15.9% Agentic terminal tasks
τ²-Bench 36.5% Conversational agent benchmark

Content

IFBench 39.6% Instruction following
LCR 54.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does DeepSeek: R1 stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID deepseek/deepseek-r1
Providerdeepseek
Model FamilyDeepSeek
Release Date January 20, 2025
Context Length64,000 tokens
Max Completion16,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.70 $0.000700
Output $2.50 $0.002500

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

96.9%
Avg Uptime
1,613ms
Best Latency (TTFT)
61 tok/s
Best Throughput
2/2
Active Endpoints
Available via: Novita, Azure

Leaderboard Categories