Coding

Best models for code generation and debugging.

Updated July 4, 2026

Best models for code generation and debugging.

# Model Score AI Index Context Input / 1M Output / 1M Caps
1Anthropic: Claude Fable 5anthropic New Top Pick93.359.91M$10.00$50.00
2Anthropic: Claude Sonnet 5anthropic New In-House Pick9353.41M$2.00$10.00
3Anthropic: Claude Opus 4.8anthropic Top Pick In-House Pick92.355.71M$5.00$25.00
4Google: Gemini 3.1 Pro Previewgoogle89.846.51M$2.00$12.00
5Anthropic: Claude Opus 4.7anthropic89.453.51M$5.00$25.00
6Google: Gemini 3.5 Flashgoogle88.950.21M$1.50$9.00
7OpenAI: GPT-5.5openai Top Pick88.354.81.1M$5.00$30.00
8OpenAI: GPT-5.4openai8751.41.1M$2.50$15.00
9Z.ai: GLM 5.2z-ai New86.151.11M$0.7700$2.42
10Anthropic: Claude Sonnet 4.6anthropic85.147.21M$3.00$15.00
11Qwen: Qwen3.7 Maxqwen83.5461M$1.25$3.75
12Anthropic: Claude Opus 4.6anthropic8337.81M$5.00$25.00
13DeepSeek: DeepSeek V4 Prodeepseek82.144.31M$0.4350$0.8700
14Qwen: Qwen3 Coder Nextqwen8221.2262K$0.1100$0.8000
15Anthropic: Claude Sonnet 4anthropic8228.91M$3.00$15.00
16Kwaipilot: KAT-Coder-Pro V2kwaipilot8235.4256K$0.3000$1.20
17Qwen3 Coder 480B A35B InstructAlibaba8218$1.50$7.50
18Prime Intellect: INTELLECT-3prime-intellect8215.6131K$0.2000$1.10
19OpenAI: gpt-oss-20bopenai8214.9131K$0.0290$0.1400
20OpenAI: GPT-4.1 Miniopenai8214.81M$0.4000$1.60
21Anthropic: Claude Haiku 4.5anthropic8229.6200K$1.00$5.00
22Qwen: Qwen3 235B A22B Instruct 2507qwen8219.6262K$0.0900$0.1000
23Anthropic: Claude 3.5 Haikuanthropic8212.3200K$0.8000$4.00
24o1-previewOpenAI8217$16.50$66.00
25OpenAI: GPT-5.2-Codexopenai8240.1400K$1.75$14.00
26Qwen: Qwen3 Next 80B A3B Thinkingqwen8216.7262K$0.0975$0.7800
27Google: Gemini 2.5 Pro Preview 05-06google8229.51M$1.25$10.00
28Z.ai: GLM 5.1z-ai8240.2203K$0.9660$3.04
29Qwen3 4B 2507 (Reasoning)Alibaba8212FreeFree
30Anthropic: Claude Opus 4.5anthropic8234.7200K$5.00$25.00
31Anthropic: Claude Opus 4.1anthropic8233.7200K$15.00$75.00
32GPT-5.5 (high)OpenAI Best for Coding8253.1$5.00$30.00
33MiniMax: MiniMax M3minimax8244.41M$0.3000$1.20
34NVIDIA: Llama 3.3 Nemotron Super 49B V1.5nvidia8212.2131K$0.4000$0.4000
35MoonshotAI: Kimi K2 0711moonshotai8219.4131K$0.5700$2.30
36OpenAI: GPT-4 Turboopenai827.9128K$10.00$30.00
37o1-miniOpenAI8214FreeFree
38Google: Gemini 3 Flash Previewgoogle8237.81M$0.5000$3.00
39Qwen: Qwen3 Next 80B A3B Instruct (free)qwen8220.1262KFreeFree
40Qwen: Qwen3.6 Plusqwen8239.61M$0.3250$1.95
41Qwen3 4B (Reasoning)Alibaba828.4$0.1100$1.26
42Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google8239.666K$2.00$12.00
43DeepSeek: DeepSeek V3 0324deepseek8215.4164K$0.2400$0.9000
44Muse SparkMeta8243.1FreeFree
45Qwen: Qwen3.7 Plusqwen82391M$0.3200$1.28
46Z.ai: GLM 4.6z-ai8225.1203K$0.4300$1.74
47Mistral: Devstral Mediummistralai8212.4131K$0.4000$2.00
48Qwen: Qwen3.5-122B-A10Bqwen8232.3262K$0.2600$2.08
49Magistral Small 1Mistral8210.7FreeFree
50Xiaomi: MiMo-V2-Flashxiaomi8224.7262K$0.1000$0.3000
#1NewTop Pick93.3
Anthropic: Claude Fable 5anthropic
AI 59.91M ctx$10.00/M in
#2NewIn-House Pick93
Anthropic: Claude Sonnet 5anthropic
AI 53.41M ctx$2.00/M in
#3Top PickIn-House Pick92.3
Anthropic: Claude Opus 4.8anthropic
AI 55.71M ctx$5.00/M in
#489.8
Google: Gemini 3.1 Pro Previewgoogle
AI 46.51M ctx$2.00/M in
#589.4
Anthropic: Claude Opus 4.7anthropic
AI 53.51M ctx$5.00/M in
#688.9
Google: Gemini 3.5 Flashgoogle
AI 50.21M ctx$1.50/M in
#7Top Pick88.3
OpenAI: GPT-5.5openai
AI 54.81.1M ctx$5.00/M in
#887
OpenAI: GPT-5.4openai
AI 51.41.1M ctx$2.50/M in
#9New86.1
Z.ai: GLM 5.2z-ai
AI 51.11M ctx$0.7700/M in
#1085.1
Anthropic: Claude Sonnet 4.6anthropic
AI 47.21M ctx$3.00/M in
#1183.5
Qwen: Qwen3.7 Maxqwen
AI 461M ctx$1.25/M in
#1283
Anthropic: Claude Opus 4.6anthropic
AI 37.81M ctx$5.00/M in
#1382.1
DeepSeek: DeepSeek V4 Prodeepseek
AI 44.31M ctx$0.4350/M in
#1482
Qwen: Qwen3 Coder Nextqwen
AI 21.2262K ctx$0.1100/M in
#1582
Anthropic: Claude Sonnet 4anthropic
AI 28.91M ctx$3.00/M in
#1682
Kwaipilot: KAT-Coder-Pro V2kwaipilot
AI 35.4256K ctx$0.3000/M in
#1782
Qwen3 Coder 480B A35B InstructAlibaba
AI 18$1.50/M in
#1882
Prime Intellect: INTELLECT-3prime-intellect
AI 15.6131K ctx$0.2000/M in
#1982
OpenAI: gpt-oss-20bopenai
AI 14.9131K ctx$0.0290/M in
#2082
OpenAI: GPT-4.1 Miniopenai
AI 14.81M ctx$0.4000/M in
#2182
Anthropic: Claude Haiku 4.5anthropic
AI 29.6200K ctx$1.00/M in
#2282
Qwen: Qwen3 235B A22B Instruct 2507qwen
AI 19.6262K ctx$0.0900/M in
#2382
Anthropic: Claude 3.5 Haikuanthropic
AI 12.3200K ctx$0.8000/M in
#2482
o1-previewOpenAI
AI 17$16.50/M in
#2582
OpenAI: GPT-5.2-Codexopenai
AI 40.1400K ctx$1.75/M in
#2682
Qwen: Qwen3 Next 80B A3B Thinkingqwen
AI 16.7262K ctx$0.0975/M in
#2782
Google: Gemini 2.5 Pro Preview 05-06google
AI 29.51M ctx$1.25/M in
#2882
Z.ai: GLM 5.1z-ai
AI 40.2203K ctx$0.9660/M in
#2982
Qwen3 4B 2507 (Reasoning)Alibaba
AI 12Free/M in
#3082
Anthropic: Claude Opus 4.5anthropic
AI 34.7200K ctx$5.00/M in
#3182
Anthropic: Claude Opus 4.1anthropic
AI 33.7200K ctx$15.00/M in
#32Best for Coding82
GPT-5.5 (high)OpenAI
AI 53.1$5.00/M in
#3382
MiniMax: MiniMax M3minimax
AI 44.41M ctx$0.3000/M in
#3482
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5nvidia
AI 12.2131K ctx$0.4000/M in
#3582
MoonshotAI: Kimi K2 0711moonshotai
AI 19.4131K ctx$0.5700/M in
#3682
OpenAI: GPT-4 Turboopenai
AI 7.9128K ctx$10.00/M in
#3782
o1-miniOpenAI
AI 14Free/M in
#3882
Google: Gemini 3 Flash Previewgoogle
AI 37.81M ctx$0.5000/M in
#3982
Qwen: Qwen3 Next 80B A3B Instruct (free)qwen
AI 20.1262K ctxFree/M in
#4082
Qwen: Qwen3.6 Plusqwen
AI 39.61M ctx$0.3250/M in
#4182
Qwen3 4B (Reasoning)Alibaba
AI 8.4$0.1100/M in
#4282
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google
AI 39.666K ctx$2.00/M in
#4382
DeepSeek: DeepSeek V3 0324deepseek
AI 15.4164K ctx$0.2400/M in
#4482
Muse SparkMeta
AI 43.1Free/M in
#4582
Qwen: Qwen3.7 Plusqwen
AI 391M ctx$0.3200/M in
#4682
Z.ai: GLM 4.6z-ai
AI 25.1203K ctx$0.4300/M in
#4782
Mistral: Devstral Mediummistralai
AI 12.4131K ctx$0.4000/M in
#4882
Qwen: Qwen3.5-122B-A10Bqwen
AI 32.3262K ctx$0.2600/M in
#4982
Magistral Small 1Mistral
AI 10.7Free/M in
#5082
Xiaomi: MiMo-V2-Flashxiaomi
AI 24.7262K ctx$0.1000/M in

How we rank AI models

The Design for Online AI Model Leaderboard scores 592 models on a single 0–100 scale built from four weighted dimensions: intelligence (reasoning and knowledge benchmarks), technical capability (coding and tool use), content quality (writing and instruction-following) and value (capability per dollar).

Underlying data is aggregated from the OpenRouter API for pricing and availability, Artificial Analysis for intelligence, coding and agentic indices, and the Hugging Face Open LLM Leaderboard for open-model benchmarks. We refresh these sources daily and layer our own editorial review on top, so a model that benchmarks well but is impractical to deploy will not automatically top the table.

Models are grouped into tiers (Frontier, Professional, Specialist, Efficient, Emerging and Legacy) to make like-for-like comparison easier, and newly released models are flagged so you can see what has just landed.

Leaderboard FAQ

How often is the leaderboard updated?

Pricing, availability and benchmark data are synced daily from our sources, and editorial scores are reviewed whenever a significant new model is released.

How is the overall score calculated?

Each model is graded 0–10 on intelligence, technical capability, content quality and value; those dimensions are weighted and combined into the 0–100 overall score used to rank the table.

Where does the data come from?

From the OpenRouter API, Artificial Analysis and the Hugging Face Open LLM Leaderboard, combined with hands-on editorial testing by the Design for Online team.