General

Compare top AI models across all use cases.

Updated June 12, 2026

Compare top AI models across all use cases.

# Model Score AI Index Context Input / 1M Output / 1M Caps
1Anthropic: Claude Fable 5anthropic New Top Pick94.764.91M$10.00$50.00
2Anthropic: Claude Opus 4.8anthropic New Top Pick In-House Pick92.461.41M$5.00$25.00
3Google: Gemini 3.1 Pro Previewgoogle Best for Agents91.757.21M$2.00$12.00
4OpenAI: GPT-5.5openai Top Pick88.860.21.1M$5.00$30.00
5Anthropic: Claude Opus 4.7anthropic88.357.31M$5.00$25.00
6Anthropic: Claude Sonnet 4.6anthropic In-House Pick84.444.41M$3.00$15.00
7Qwen: Qwen3.7 Maxqwen New83.956.61M$1.25$3.75
8OpenAI: GPT-5.3-Codexopenai83.553.6400K$1.75$14.00
9Google: Gemini 3 Flash Previewgoogle82.5351M$0.5000$3.00
10Google: Gemini 2.5 Pro Preview 05-06google8229.51M$1.25$10.00
11OpenAI: o3 Deep Researchopenai8238.3200K$10.00$40.00
12Grok 4.20 0309 (Reasoning)xAI8248.5$2.00$6.00
13OpenAI: gpt-oss-20bopenai8220.8131K$0.0290$0.1400
14Xiaomi: MiMo-V2-Omnixiaomi8243.4262K$0.4000$2.00
15OpenAI: o3openai8238.4200K$2.00$8.00
16GPT-5.5 (high)OpenAI Best for Coding8258.9$5.00$30.00
17OpenAI: o4 Mini Deep Researchopenai8233200K$2.00$8.00
18Qwen2.5 MaxAlibaba8216.3$1.60$6.40
19Anthropic: Claude Opus 4.1anthropic8242200K$15.00$75.00
20Xiaomi: MiMo-V2-Proxiaomi8249.21M$1.00$3.00
21OpenAI: GPT-5.2openai8251.3400K$1.75$14.00
22OpenAI: o4 Miniopenai8233.1200K$1.10$4.40
23Muse SparkMeta8252.2FreeFree
24Anthropic: Claude Sonnet 4.5anthropic82431M$3.00$15.00
25Anthropic: Claude 3.5 Haikuanthropic8218.7200K$0.8000$4.00
26Qwen: Qwen3.5 397B A17Bqwen8240.1262K$0.3900$2.34
27xAI: Grok 4x-ai8241.5256K$3.00$15.00
28MiniMax: MiniMax M3minimax New8254.71M$0.3000$1.20
29Anthropic: Claude Opus 4.5anthropic8243.1200K$5.00$25.00
30OpenAI: GPT-4.1openai8226.31M$2.00$8.00
31Magistral Medium 1.2Mistral8227.1$2.00$5.00
32OpenAI: GPT-5 Codexopenai8244.6400K$1.25$10.00
33Anthropic: Claude 3.5 Sonnetanthropic8215.9200K$6.00$30.00
34inclusionAI: Ling-2.6-flashinclusionai8226.2262K$0.0100$0.0300
35Z.ai: GLM 5z-ai8249.8203K$0.6000$1.92
36Google: Gemini 2.5 Flashgoogle8220.61M$0.3000$2.50
37Qwen: Qwen3.6 Plusqwen82501M$0.3250$1.95
38Qwen: Qwen3.7 Plusqwen New8253.31M$0.3200$1.28
39Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google8248.466K$2.00$12.00
40OpenAI: o1-proopenai8225.8200K$150.00$600.00
41Nova 2.0 Omni (medium)Amazon8228$0.3000$2.50
42xAI: Grok Code Fast 1x-ai8228.7256K$0.2000$1.50
43OpenAI: GPT-4o (2024-08-06)openai8218.6128K$2.50$10.00
44xAI: Grok 4.3x-ai8248.81M$1.25$2.50
45Anthropic: Claude Opus 4.6anthropic8252.91M$5.00$25.00
46Google: Gemini 2.5 Progoogle8234.61M$1.25$10.00
47NVIDIA: Nemotron 3 Ultranvidia New8247.71M$0.5000$2.50
48xAI: Grok 4.1 Fastx-ai8223.62M$0.2000$0.5000
49Anthropic: Claude 3.7 Sonnetanthropic8230.8200K$3.00$15.00
50Nova 2.0 Lite (high)Amazon8234.5$0.3000$2.50
#1NewTop Pick94.7
Anthropic: Claude Fable 5anthropic
AI 64.91M ctx$10.00/M in
#2NewTop PickIn-House Pick92.4
Anthropic: Claude Opus 4.8anthropic
AI 61.41M ctx$5.00/M in
#3Best for Agents91.7
Google: Gemini 3.1 Pro Previewgoogle
AI 57.21M ctx$2.00/M in
#4Top Pick88.8
OpenAI: GPT-5.5openai
AI 60.21.1M ctx$5.00/M in
#588.3
Anthropic: Claude Opus 4.7anthropic
AI 57.31M ctx$5.00/M in
#6In-House Pick84.4
Anthropic: Claude Sonnet 4.6anthropic
AI 44.41M ctx$3.00/M in
#7New83.9
Qwen: Qwen3.7 Maxqwen
AI 56.61M ctx$1.25/M in
#883.5
OpenAI: GPT-5.3-Codexopenai
AI 53.6400K ctx$1.75/M in
#982.5
Google: Gemini 3 Flash Previewgoogle
AI 351M ctx$0.5000/M in
#1082
Google: Gemini 2.5 Pro Preview 05-06google
AI 29.51M ctx$1.25/M in
#1182
OpenAI: o3 Deep Researchopenai
AI 38.3200K ctx$10.00/M in
#1282
Grok 4.20 0309 (Reasoning)xAI
AI 48.5$2.00/M in
#1382
OpenAI: gpt-oss-20bopenai
AI 20.8131K ctx$0.0290/M in
#1482
Xiaomi: MiMo-V2-Omnixiaomi
AI 43.4262K ctx$0.4000/M in
#1582
OpenAI: o3openai
AI 38.4200K ctx$2.00/M in
#16Best for Coding82
GPT-5.5 (high)OpenAI
AI 58.9$5.00/M in
#1782
OpenAI: o4 Mini Deep Researchopenai
AI 33200K ctx$2.00/M in
#1882
Qwen2.5 MaxAlibaba
AI 16.3$1.60/M in
#1982
Anthropic: Claude Opus 4.1anthropic
AI 42200K ctx$15.00/M in
#2082
Xiaomi: MiMo-V2-Proxiaomi
AI 49.21M ctx$1.00/M in
#2182
OpenAI: GPT-5.2openai
AI 51.3400K ctx$1.75/M in
#2282
OpenAI: o4 Miniopenai
AI 33.1200K ctx$1.10/M in
#2382
Muse SparkMeta
AI 52.2Free/M in
#2482
Anthropic: Claude Sonnet 4.5anthropic
AI 431M ctx$3.00/M in
#2582
Anthropic: Claude 3.5 Haikuanthropic
AI 18.7200K ctx$0.8000/M in
#2682
Qwen: Qwen3.5 397B A17Bqwen
AI 40.1262K ctx$0.3900/M in
#2782
xAI: Grok 4x-ai
AI 41.5256K ctx$3.00/M in
#28New82
MiniMax: MiniMax M3minimax
AI 54.71M ctx$0.3000/M in
#2982
Anthropic: Claude Opus 4.5anthropic
AI 43.1200K ctx$5.00/M in
#3082
OpenAI: GPT-4.1openai
AI 26.31M ctx$2.00/M in
#3182
Magistral Medium 1.2Mistral
AI 27.1$2.00/M in
#3282
OpenAI: GPT-5 Codexopenai
AI 44.6400K ctx$1.25/M in
#3382
Anthropic: Claude 3.5 Sonnetanthropic
AI 15.9200K ctx$6.00/M in
#3482
inclusionAI: Ling-2.6-flashinclusionai
AI 26.2262K ctx$0.0100/M in
#3582
Z.ai: GLM 5z-ai
AI 49.8203K ctx$0.6000/M in
#3682
Google: Gemini 2.5 Flashgoogle
AI 20.61M ctx$0.3000/M in
#3782
Qwen: Qwen3.6 Plusqwen
AI 501M ctx$0.3250/M in
#38New82
Qwen: Qwen3.7 Plusqwen
AI 53.31M ctx$0.3200/M in
#3982
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google
AI 48.466K ctx$2.00/M in
#4082
OpenAI: o1-proopenai
AI 25.8200K ctx$150.00/M in
#4182
Nova 2.0 Omni (medium)Amazon
AI 28$0.3000/M in
#4282
xAI: Grok Code Fast 1x-ai
AI 28.7256K ctx$0.2000/M in
#4382
OpenAI: GPT-4o (2024-08-06)openai
AI 18.6128K ctx$2.50/M in
#4482
xAI: Grok 4.3x-ai
AI 48.81M ctx$1.25/M in
#4582
Anthropic: Claude Opus 4.6anthropic
AI 52.91M ctx$5.00/M in
#4682
Google: Gemini 2.5 Progoogle
AI 34.61M ctx$1.25/M in
#47New82
NVIDIA: Nemotron 3 Ultranvidia
AI 47.71M ctx$0.5000/M in
#4882
xAI: Grok 4.1 Fastx-ai
AI 23.62M ctx$0.2000/M in
#4982
Anthropic: Claude 3.7 Sonnetanthropic
AI 30.8200K ctx$3.00/M in
#5082
Nova 2.0 Lite (high)Amazon
AI 34.5$0.3000/M in

How we rank AI models

The Design for Online AI Model Leaderboard scores 578 models on a single 0–100 scale built from four weighted dimensions: intelligence (reasoning and knowledge benchmarks), technical capability (coding and tool use), content quality (writing and instruction-following) and value (capability per dollar).

Underlying data is aggregated from the OpenRouter API for pricing and availability, Artificial Analysis for intelligence, coding and agentic indices, and the Hugging Face Open LLM Leaderboard for open-model benchmarks. We refresh these sources daily and layer our own editorial review on top, so a model that benchmarks well but is impractical to deploy will not automatically top the table.

Models are grouped into tiers (Frontier, Professional, Specialist, Efficient, Emerging and Legacy) to make like-for-like comparison easier, and newly released models are flagged so you can see what has just landed.

Leaderboard FAQ

How often is the leaderboard updated?

Pricing, availability and benchmark data are synced daily from our sources, and editorial scores are reviewed whenever a significant new model is released.

How is the overall score calculated?

Each model is graded 0–10 on intelligence, technical capability, content quality and value; those dimensions are weighted and combined into the 0–100 overall score used to rank the table.

Where does the data come from?

From the OpenRouter API, Artificial Analysis and the Hugging Face Open LLM Leaderboard, combined with hands-on editorial testing by the Design for Online team.