General

Compare top AI models across all use cases.

Updated July 2, 2026

Compare top AI models across all use cases.

# Model Score AI Index Context Input / 1M Output / 1M Caps
1Anthropic: Claude Fable 5anthropic New Top Pick93.359.91M$10.00$50.00
2Anthropic: Claude Sonnet 5anthropic New In-House Pick9353.41M$2.00$10.00
3Anthropic: Claude Opus 4.8anthropic Top Pick In-House Pick92.355.71M$5.00$25.00
4Google: Gemini 3.1 Pro Previewgoogle89.846.51M$2.00$12.00
5Anthropic: Claude Opus 4.7anthropic89.453.51M$5.00$25.00
6Google: Gemini 3.5 Flashgoogle88.950.21M$1.50$9.00
7OpenAI: GPT-5.5openai Top Pick88.354.81.1M$5.00$30.00
8OpenAI: GPT-5.4openai8751.41.1M$2.50$15.00
9Anthropic: Claude Sonnet 4.6anthropic85.147.21M$3.00$15.00
10Qwen: Qwen3.7 Maxqwen83.5461M$1.25$3.75
11Anthropic: Claude Opus 4.6anthropic8343.71M$5.00$25.00
12MiniMax: MiniMax M2.1minimax8231.4205K$0.3000$1.20
13OpenAI: GPT-4.1openai8219.41M$2.00$8.00
14Solar Pro 2 (Non-reasoning)Upstage827.8FreeFree
15Anthropic: Claude Haiku 4.5anthropic8229.6200K$1.00$5.00
16NVIDIA: Llama 3.1 Nemotron 70B Instructnvidia827.6131K$1.20$1.20
17Anthropic: Claude Opus 4.1anthropic8233.7200K$15.00$75.00
18Qwen: Qwen3.6 Plusqwen8239.61M$0.3250$1.95
19Qwen: Qwen3.7 Plusqwen82391M$0.3200$1.28
20Google: Gemini 3 Flash Previewgoogle8237.81M$0.5000$3.00
21OpenAI: o1-proopenai8218.9200K$150.00$600.00
22Nemotron Cascade 2 30B A3BNVIDIA8221.3FreeFree
23OpenAI: o3 Deep Researchopenai8238.3200K$10.00$40.00
24OpenAI: GPT-4o-miniopenai826.9128K$0.1500$0.6000
25xAI: Grok 4x-ai8233.3256K$3.00$15.00
26NVIDIA: Nemotron 3 Ultranvidia New8237.81M$0.5000$2.20
27Google: Gemini 2.0 Flash Litegoogle8212.31M$0.0750$0.3000
28Qwen3.6 35B A3B (Non-reasoning)Alibaba8224.2$0.3750$2.25
29OpenAI: o4 Mini Deep Researchopenai8233200K$2.00$8.00
30OpenAI: GPT-4o (2024-05-13)openai828.6128K$5.00$15.00
31Baidu: ERNIE 4.5 300B A47Bbaidu829131K$0.2800$1.10
32MoonshotAI: Kimi K2.6moonshotai Updated8242.8262K$0.6600$3.41
33Anthropic: Claude 3.7 Sonnetanthropic8223.5200K$3.00$15.00
34Qwen: Qwen3.6 27Bqwen8237.1262K$0.2850$2.40
35Anthropic: Claude Sonnet 4.5anthropic8236.41M$3.00$15.00
36OpenAI: GPT-4oopenai8211.2128K$2.50$10.00
37xAI: Grok 4.3x-ai8237.61M$1.25$2.50
38OpenAI: GPT-5.3-Codexopenai8244.3400K$1.75$14.00
39Google: Gemini 2.5 Progoogle8225.81M$1.25$10.00
40GPT-5.5 (Non-reasoning)OpenAI8235.4$5.00$30.00
41Nex AGI: Nex-N2-Pronex-agi New Best for Agents8241262K$0.2500$1.00
42OpenAI: GPT-5.2openai8226400K$1.75$14.00
43Anthropic: Claude 3.7 Sonnet (thinking)anthropic8227.1200K$3.00$15.00
44Qwen3.5 4B (Reasoning)Alibaba8220.1$0.0300$0.1500
45Google: Gemini 2.5 Flash Lite Preview 09-2025google8213.11M$0.1000$0.4000
46OpenAI: o3 Proopenai8232.5200K$20.00$80.00
47GPT-5.5 (medium)OpenAI8250.4$5.00$30.00
48Grok Build 0.1 0616xAI New8239.8$1.00$2.00
49Anthropic: Claude Opus 4.5anthropic8240.8200K$5.00$25.00
50Google: Gemini 2.0 Flashgoogle8210.71M$0.1000$0.4000
#1NewTop Pick93.3
Anthropic: Claude Fable 5anthropic
AI 59.91M ctx$10.00/M in
#2NewIn-House Pick93
Anthropic: Claude Sonnet 5anthropic
AI 53.41M ctx$2.00/M in
#3Top PickIn-House Pick92.3
Anthropic: Claude Opus 4.8anthropic
AI 55.71M ctx$5.00/M in
#489.8
Google: Gemini 3.1 Pro Previewgoogle
AI 46.51M ctx$2.00/M in
#589.4
Anthropic: Claude Opus 4.7anthropic
AI 53.51M ctx$5.00/M in
#688.9
Google: Gemini 3.5 Flashgoogle
AI 50.21M ctx$1.50/M in
#7Top Pick88.3
OpenAI: GPT-5.5openai
AI 54.81.1M ctx$5.00/M in
#887
OpenAI: GPT-5.4openai
AI 51.41.1M ctx$2.50/M in
#985.1
Anthropic: Claude Sonnet 4.6anthropic
AI 47.21M ctx$3.00/M in
#1083.5
Qwen: Qwen3.7 Maxqwen
AI 461M ctx$1.25/M in
#1183
Anthropic: Claude Opus 4.6anthropic
AI 43.71M ctx$5.00/M in
#1282
MiniMax: MiniMax M2.1minimax
AI 31.4205K ctx$0.3000/M in
#1382
OpenAI: GPT-4.1openai
AI 19.41M ctx$2.00/M in
#1482
Solar Pro 2 (Non-reasoning)Upstage
AI 7.8Free/M in
#1582
Anthropic: Claude Haiku 4.5anthropic
AI 29.6200K ctx$1.00/M in
#1682
NVIDIA: Llama 3.1 Nemotron 70B Instructnvidia
AI 7.6131K ctx$1.20/M in
#1782
Anthropic: Claude Opus 4.1anthropic
AI 33.7200K ctx$15.00/M in
#1882
Qwen: Qwen3.6 Plusqwen
AI 39.61M ctx$0.3250/M in
#1982
Qwen: Qwen3.7 Plusqwen
AI 391M ctx$0.3200/M in
#2082
Google: Gemini 3 Flash Previewgoogle
AI 37.81M ctx$0.5000/M in
#2182
OpenAI: o1-proopenai
AI 18.9200K ctx$150.00/M in
#2282
Nemotron Cascade 2 30B A3BNVIDIA
AI 21.3Free/M in
#2382
OpenAI: o3 Deep Researchopenai
AI 38.3200K ctx$10.00/M in
#2482
OpenAI: GPT-4o-miniopenai
AI 6.9128K ctx$0.1500/M in
#2582
xAI: Grok 4x-ai
AI 33.3256K ctx$3.00/M in
#26New82
NVIDIA: Nemotron 3 Ultranvidia
AI 37.81M ctx$0.5000/M in
#2782
Google: Gemini 2.0 Flash Litegoogle
AI 12.31M ctx$0.0750/M in
#2882
Qwen3.6 35B A3B (Non-reasoning)Alibaba
AI 24.2$0.3750/M in
#2982
OpenAI: o4 Mini Deep Researchopenai
AI 33200K ctx$2.00/M in
#3082
OpenAI: GPT-4o (2024-05-13)openai
AI 8.6128K ctx$5.00/M in
#3182
Baidu: ERNIE 4.5 300B A47Bbaidu
AI 9131K ctx$0.2800/M in
#3282
MoonshotAI: Kimi K2.6moonshotai
AI 42.8262K ctx$0.6600/M in
#3382
Anthropic: Claude 3.7 Sonnetanthropic
AI 23.5200K ctx$3.00/M in
#3482
Qwen: Qwen3.6 27Bqwen
AI 37.1262K ctx$0.2850/M in
#3582
Anthropic: Claude Sonnet 4.5anthropic
AI 36.41M ctx$3.00/M in
#3682
OpenAI: GPT-4oopenai
AI 11.2128K ctx$2.50/M in
#3782
xAI: Grok 4.3x-ai
AI 37.61M ctx$1.25/M in
#3882
OpenAI: GPT-5.3-Codexopenai
AI 44.3400K ctx$1.75/M in
#3982
Google: Gemini 2.5 Progoogle
AI 25.81M ctx$1.25/M in
#4082
GPT-5.5 (Non-reasoning)OpenAI
AI 35.4$5.00/M in
#41NewBest for Agents82
Nex AGI: Nex-N2-Pronex-agi
AI 41262K ctx$0.2500/M in
#4282
OpenAI: GPT-5.2openai
AI 26400K ctx$1.75/M in
#4382
Anthropic: Claude 3.7 Sonnet (thinking)anthropic
AI 27.1200K ctx$3.00/M in
#4482
Qwen3.5 4B (Reasoning)Alibaba
AI 20.1$0.0300/M in
#4582
Google: Gemini 2.5 Flash Lite Preview 09-2025google
AI 13.11M ctx$0.1000/M in
#4682
OpenAI: o3 Proopenai
AI 32.5200K ctx$20.00/M in
#4782
GPT-5.5 (medium)OpenAI
AI 50.4$5.00/M in
#48New82
Grok Build 0.1 0616xAI
AI 39.8$1.00/M in
#4982
Anthropic: Claude Opus 4.5anthropic
AI 40.8200K ctx$5.00/M in
#5082
Google: Gemini 2.0 Flashgoogle
AI 10.71M ctx$0.1000/M in

How we rank AI models

The Design for Online AI Model Leaderboard scores 592 models on a single 0–100 scale built from four weighted dimensions: intelligence (reasoning and knowledge benchmarks), technical capability (coding and tool use), content quality (writing and instruction-following) and value (capability per dollar).

Underlying data is aggregated from the OpenRouter API for pricing and availability, Artificial Analysis for intelligence, coding and agentic indices, and the Hugging Face Open LLM Leaderboard for open-model benchmarks. We refresh these sources daily and layer our own editorial review on top, so a model that benchmarks well but is impractical to deploy will not automatically top the table.

Models are grouped into tiers (Frontier, Professional, Specialist, Efficient, Emerging and Legacy) to make like-for-like comparison easier, and newly released models are flagged so you can see what has just landed.

Leaderboard FAQ

How often is the leaderboard updated?

Pricing, availability and benchmark data are synced daily from our sources, and editorial scores are reviewed whenever a significant new model is released.

How is the overall score calculated?

Each model is graded 0–10 on intelligence, technical capability, content quality and value; those dimensions are weighted and combined into the 0–100 overall score used to rank the table.

Where does the data come from?

From the OpenRouter API, Artificial Analysis and the Hugging Face Open LLM Leaderboard, combined with hands-on editorial testing by the Design for Online team.