Coding

Best models for code generation and debugging.

Updated June 12, 2026

Best models for code generation and debugging.

# Model Score AI Index Context Input / 1M Output / 1M Caps
1Anthropic: Claude Fable 5anthropic New Top Pick94.764.91M$10.00$50.00
2Anthropic: Claude Opus 4.8anthropic New Top Pick In-House Pick92.461.41M$5.00$25.00
3Google: Gemini 3.1 Pro Previewgoogle Best for Agents91.757.21M$2.00$12.00
4OpenAI: GPT-5.5openai Top Pick88.860.21.1M$5.00$30.00
5Anthropic: Claude Opus 4.7anthropic88.357.31M$5.00$25.00
6Anthropic: Claude Sonnet 4.6anthropic In-House Pick84.444.41M$3.00$15.00
7Qwen: Qwen3.7 Maxqwen New83.956.61M$1.25$3.75
8OpenAI: GPT-5.3-Codexopenai83.553.6400K$1.75$14.00
9Google: Gemini 3 Flash Previewgoogle82.5351M$0.5000$3.00
10Anthropic: Claude Opus 4.6anthropic8252.91M$5.00$25.00
11Qwen2.5 Coder 32B Instructqwen8212.9128K$0.6600$1.00
12Magistral Small 1Mistral8216.8FreeFree
13DeepSeek: DeepSeek V3.2deepseek8232.1131K$0.2288$0.3432
14Google: Gemini 2.5 Flashgoogle8220.61M$0.3000$2.50
15Xiaomi: MiMo-V2.5-Proxiaomi8253.81M$0.4350$0.8700
16GPT-5.5 Instant (May 2026)OpenAI8241.8$5.00$30.00
17Anthropic: Claude Sonnet 4.5anthropic82431M$3.00$15.00
18Devstral Small 2Mistral8219.5FreeFree
19Qwen: Qwen3 Coder Nextqwen8228.3262K$0.1100$0.8000
20Qwen: Qwen3 Coder 30B A3B Instructqwen8220160K$0.0700$0.2700
21Anthropic: Claude 3.5 Sonnetanthropic8215.9200K$6.00$30.00
22Magistral Medium 1Mistral8218.8FreeFree
23Anthropic: Claude Opus 4.5anthropic8243.1200K$5.00$25.00
24Google: Gemini 2.5 Progoogle8234.61M$1.25$10.00
25Xiaomi: MiMo-V2.5xiaomi82491M$0.1400$0.2800
26DeepSeek: DeepSeek V3.2 Expdeepseek8228.4164K$0.2700$0.4100
27DeepSeek: DeepSeek V3 0324deepseek8222.3164K$0.2000$0.7700
28Llama Nemotron Super 49B v1.5 (Reasoning)NVIDIA8218.7$0.1000$0.4000
29OpenAI: GPT-5.2-Codexopenai8249400K$1.75$14.00
30OpenAI: GPT-5.4openai8235.41.1M$2.50$15.00
31DeepSeek R1 Distill Qwen 14BDeepSeek8215.8FreeFree
32Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google8248.466K$2.00$12.00
33Google: Gemini 2.5 Pro Preview 06-05google8230.31M$1.25$10.00
34Tencent: Hy3 preview (free)tencent8241.9262KFreeFree
35Qwen: QwQ 32Bqwen8219.7131K$0.1500$0.5800
36Nemotron Cascade 2 30B A3BNVIDIA8228.4FreeFree
37Z.ai: GLM 4.7z-ai8234.2203K$0.4000$1.75
38DeepSeek-Coder-V2DeepSeek8210.6FreeFree
39xAI: Grok 4.1 Fastx-ai8223.62M$0.2000$0.5000
40Anthropic: Claude Opus 4anthropic8239200K$15.00$75.00
41DeepSeek: DeepSeek V4 Prodeepseek8251.51M$0.4350$0.8700
42Qwen: Qwen3.7 Plusqwen New8253.31M$0.3200$1.28
43Anthropic: Claude 3.7 Sonnetanthropic8230.8200K$3.00$15.00
44Hermes 4 – Llama-3.1 405B (Reasoning)Nous Research8218.6$1.00$3.00
45OpenAI: GPT-5.4 Nanoopenai8244400K$0.2000$1.25
46DeepSeek Coder V2 Lite InstructDeepSeek828.5FreeFree
47Google: Gemini 3 Pro Previewgoogle8241.31M$2.00$12.00
48Anthropic: Claude Sonnet 4anthropic82331M$3.00$15.00
49GPT-5.5 (Non-reasoning)OpenAI8240.9$5.00$30.00
50OpenAI: GPT-5 Codexopenai8244.6400K$1.25$10.00
#1NewTop Pick94.7
Anthropic: Claude Fable 5anthropic
AI 64.91M ctx$10.00/M in
#2NewTop PickIn-House Pick92.4
Anthropic: Claude Opus 4.8anthropic
AI 61.41M ctx$5.00/M in
#3Best for Agents91.7
Google: Gemini 3.1 Pro Previewgoogle
AI 57.21M ctx$2.00/M in
#4Top Pick88.8
OpenAI: GPT-5.5openai
AI 60.21.1M ctx$5.00/M in
#588.3
Anthropic: Claude Opus 4.7anthropic
AI 57.31M ctx$5.00/M in
#6In-House Pick84.4
Anthropic: Claude Sonnet 4.6anthropic
AI 44.41M ctx$3.00/M in
#7New83.9
Qwen: Qwen3.7 Maxqwen
AI 56.61M ctx$1.25/M in
#883.5
OpenAI: GPT-5.3-Codexopenai
AI 53.6400K ctx$1.75/M in
#982.5
Google: Gemini 3 Flash Previewgoogle
AI 351M ctx$0.5000/M in
#1082
Anthropic: Claude Opus 4.6anthropic
AI 52.91M ctx$5.00/M in
#1182
Qwen2.5 Coder 32B Instructqwen
AI 12.9128K ctx$0.6600/M in
#1282
Magistral Small 1Mistral
AI 16.8Free/M in
#1382
DeepSeek: DeepSeek V3.2deepseek
AI 32.1131K ctx$0.2288/M in
#1482
Google: Gemini 2.5 Flashgoogle
AI 20.61M ctx$0.3000/M in
#1582
Xiaomi: MiMo-V2.5-Proxiaomi
AI 53.81M ctx$0.4350/M in
#1682
GPT-5.5 Instant (May 2026)OpenAI
AI 41.8$5.00/M in
#1782
Anthropic: Claude Sonnet 4.5anthropic
AI 431M ctx$3.00/M in
#1882
Devstral Small 2Mistral
AI 19.5Free/M in
#1982
Qwen: Qwen3 Coder Nextqwen
AI 28.3262K ctx$0.1100/M in
#2082
Qwen: Qwen3 Coder 30B A3B Instructqwen
AI 20160K ctx$0.0700/M in
#2182
Anthropic: Claude 3.5 Sonnetanthropic
AI 15.9200K ctx$6.00/M in
#2282
Magistral Medium 1Mistral
AI 18.8Free/M in
#2382
Anthropic: Claude Opus 4.5anthropic
AI 43.1200K ctx$5.00/M in
#2482
Google: Gemini 2.5 Progoogle
AI 34.61M ctx$1.25/M in
#2582
Xiaomi: MiMo-V2.5xiaomi
AI 491M ctx$0.1400/M in
#2682
DeepSeek: DeepSeek V3.2 Expdeepseek
AI 28.4164K ctx$0.2700/M in
#2782
DeepSeek: DeepSeek V3 0324deepseek
AI 22.3164K ctx$0.2000/M in
#2882
Llama Nemotron Super 49B v1.5 (Reasoning)NVIDIA
AI 18.7$0.1000/M in
#2982
OpenAI: GPT-5.2-Codexopenai
AI 49400K ctx$1.75/M in
#3082
OpenAI: GPT-5.4openai
AI 35.41.1M ctx$2.50/M in
#3182
DeepSeek R1 Distill Qwen 14BDeepSeek
AI 15.8Free/M in
#3282
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google
AI 48.466K ctx$2.00/M in
#3382
Google: Gemini 2.5 Pro Preview 06-05google
AI 30.31M ctx$1.25/M in
#3482
Tencent: Hy3 preview (free)tencent
AI 41.9262K ctxFree/M in
#3582
Qwen: QwQ 32Bqwen
AI 19.7131K ctx$0.1500/M in
#3682
Nemotron Cascade 2 30B A3BNVIDIA
AI 28.4Free/M in
#3782
Z.ai: GLM 4.7z-ai
AI 34.2203K ctx$0.4000/M in
#3882
DeepSeek-Coder-V2DeepSeek
AI 10.6Free/M in
#3982
xAI: Grok 4.1 Fastx-ai
AI 23.62M ctx$0.2000/M in
#4082
Anthropic: Claude Opus 4anthropic
AI 39200K ctx$15.00/M in
#4182
DeepSeek: DeepSeek V4 Prodeepseek
AI 51.51M ctx$0.4350/M in
#42New82
Qwen: Qwen3.7 Plusqwen
AI 53.31M ctx$0.3200/M in
#4382
Anthropic: Claude 3.7 Sonnetanthropic
AI 30.8200K ctx$3.00/M in
#4482
Hermes 4 – Llama-3.1 405B (Reasoning)Nous Research
AI 18.6$1.00/M in
#4582
OpenAI: GPT-5.4 Nanoopenai
AI 44400K ctx$0.2000/M in
#4682
DeepSeek Coder V2 Lite InstructDeepSeek
AI 8.5Free/M in
#4782
Google: Gemini 3 Pro Previewgoogle
AI 41.31M ctx$2.00/M in
#4882
Anthropic: Claude Sonnet 4anthropic
AI 331M ctx$3.00/M in
#4982
GPT-5.5 (Non-reasoning)OpenAI
AI 40.9$5.00/M in
#5082
OpenAI: GPT-5 Codexopenai
AI 44.6400K ctx$1.25/M in

How we rank AI models

The Design for Online AI Model Leaderboard scores 578 models on a single 0–100 scale built from four weighted dimensions: intelligence (reasoning and knowledge benchmarks), technical capability (coding and tool use), content quality (writing and instruction-following) and value (capability per dollar).

Underlying data is aggregated from the OpenRouter API for pricing and availability, Artificial Analysis for intelligence, coding and agentic indices, and the Hugging Face Open LLM Leaderboard for open-model benchmarks. We refresh these sources daily and layer our own editorial review on top, so a model that benchmarks well but is impractical to deploy will not automatically top the table.

Models are grouped into tiers (Frontier, Professional, Specialist, Efficient, Emerging and Legacy) to make like-for-like comparison easier, and newly released models are flagged so you can see what has just landed.

Leaderboard FAQ

How often is the leaderboard updated?

Pricing, availability and benchmark data are synced daily from our sources, and editorial scores are reviewed whenever a significant new model is released.

How is the overall score calculated?

Each model is graded 0–10 on intelligence, technical capability, content quality and value; those dimensions are weighted and combined into the 0–100 overall score used to rank the table.

Where does the data come from?

From the OpenRouter API, Artificial Analysis and the Hugging Face Open LLM Leaderboard, combined with hands-on editorial testing by the Design for Online team.