Coding

Best models for code generation and debugging.

Updated July 4, 2026

157 Models tracked

32 Providers

Daily Data refresh

Leading right now Anthropic: Claude Fable 5 1M context · $10.00/1M in 93.3 Score

Best models for code generation and debugging.

#	Model	Score	AI Index	Context	Input / 1M	Output / 1M
1	Anthropic: Claude Fable 5anthropic New Top Pick	93.3	59.9	1M	$10.00	$50.00
2	Anthropic: Claude Sonnet 5anthropic New In-House Pick	93	53.4	1M	$2.00	$10.00
3	Anthropic: Claude Opus 4.8anthropic Top Pick In-House Pick	92.3	55.7	1M	$5.00	$25.00
4	Google: Gemini 3.1 Pro Previewgoogle	89.8	46.5	1M	$2.00	$12.00
5	Anthropic: Claude Opus 4.7anthropic	89.4	53.5	1M	$5.00	$25.00
6	Google: Gemini 3.5 Flashgoogle	88.9	50.2	1M	$1.50	$9.00
7	OpenAI: GPT-5.5openai Top Pick	88.3	54.8	1.1M	$5.00	$30.00
8	OpenAI: GPT-5.4openai	87	51.4	1.1M	$2.50	$15.00
9	Z.ai: GLM 5.2z-ai New	86.1	51.1	1M	$0.7700	$2.42
10	Anthropic: Claude Sonnet 4.6anthropic	85.1	47.2	1M	$3.00	$15.00
11	Qwen: Qwen3.7 Maxqwen	83.5	46	1M	$1.25	$3.75
12	Anthropic: Claude Opus 4.6anthropic	83	37.8	1M	$5.00	$25.00
13	DeepSeek: DeepSeek V4 Prodeepseek	82.1	44.3	1M	$0.4350	$0.8700
14	xAI: Grok 4x-ai	82	33.3	256K	$3.00	$15.00
15	Inception: Mercury 2inception	82	25.3	128K	$0.2500	$0.7500
16	DeepSeek R1 Distill Qwen 14BDeepSeek	82	9.8		Free	Free
17	MoonshotAI: Kimi K2 0905moonshotai	82	23.5	262K	$0.6000	$2.50
18	Qwen: Qwen3 32Bqwen	82	11.5	131K	$0.0800	$0.2800
19	Xiaomi: MiMo-V2.5-Proxiaomi	82	42.2	1M	$0.4350	$0.8700
20	Mistral: Mistral Medium 3.5mistralai	82	29.9	262K	$1.50	$7.50
21	Z.ai: GLM 4.5z-ai	82	19.5	131K	$0.6000	$2.20
22	OpenAI: o3 Mini Highopenai	82	15.6	200K	$1.10	$4.40
23	Hermes 4 – Llama-3.1 405B (Reasoning)Nous Research	82	9		$1.00	$3.00
24	MoonshotAI: Kimi K2.7 Codemoonshotai New	82	41.9	262K	$0.7400	$3.50
25	Qwen: Qwen3 Maxqwen	82	24	262K	$0.7800	$3.90
26	TNG: DeepSeek R1T2 Chimeratngtech	82	27	164K	$0.3000	$1.10
27	DeepSeek-Coder-V2DeepSeek	82	5.1		Free	Free
28	OpenAI: GPT-5.2openai	82	26	400K	$1.75	$14.00
29	Qwen: Qwen3 30B A3B Thinking 2507qwen	82	14.4	131K	$0.1300	$1.56
30	Qwen: Qwen3 235B A22Bqwen	82	13.4	131K	$0.4550	$1.82
31	OpenAI: GPT-5.1-Codexopenai	82	34.7	400K	$1.25	$10.00
32	Z.ai: GLM 4.5 Airz-ai	82	16.5	131K	$0.1300	$0.8500
33	OpenAI: o3 Miniopenai	82	19	200K	$1.10	$4.40
34	Hermes 4 – Llama-3.1 70B (Reasoning)Nous Research	82	10		$0.1300	$0.4000
35	Qwen: Qwen3.5 397B A17Bqwen	82	33.7	256K	$0.3850	$2.45
36	Google: Gemini 2.5 Progoogle	82	25.8	1M	$1.25	$10.00
37	OpenAI: GPT-5.4 Nanoopenai	82	38.2	400K	$0.2000	$1.25
38	DeepSeek R1 Distill Llama 8BDeepSeek	82	6.4		Free	Free
39	Mistral: Devstral 2 2512mistralai	82	19.2	262K	$0.4000	$2.00
40	xAI: Grok Code Fast 1x-ai	82	21.6	256K	$0.2000	$1.50
41	OpenAI: o3openai	82	30.4	200K	$2.00	$8.00
42	DeepSeek: DeepSeek V4 Flashdeepseek Best Value	82	40.3	1M	$0.0900	$0.1800
43	inclusionAI: Ring-2.6-1Tinclusionai	82	30.6	262K	$0.0750	$0.6250
44	OpenAI: GPT-5.1-Codex-Miniopenai	82	30.6	400K	$0.2500	$2.00
45	DeepSeek: R1deepseek	82	18.5	164K	$0.7000	$2.50
46	ERNIE 5.0 Thinking PreviewBaidu	82	21.9		Free	Free
47	Nex AGI: Nex-N2-Pronex-agi New Best for Agents	82	41	262K	$0.2500	$1.00
48	Z.ai: GLM 5z-ai	82	39.5	203K	$0.6000	$1.92
49	OpenAI: GPT-5 Codexopenai	82	36.1	400K	$1.25	$10.00
50	xAI: Grok 3 Minix-ai	82	22.5	131K	$0.3000	$0.5000

Anthropic: Claude Fable 5anthropic

AI 59.91M ctx$10.00/M in

Anthropic: Claude Sonnet 5anthropic

AI 53.41M ctx$2.00/M in

Anthropic: Claude Opus 4.8anthropic

AI 55.71M ctx$5.00/M in

Google: Gemini 3.1 Pro Previewgoogle

AI 46.51M ctx$2.00/M in

Anthropic: Claude Opus 4.7anthropic

AI 53.51M ctx$5.00/M in

Google: Gemini 3.5 Flashgoogle

AI 50.21M ctx$1.50/M in

OpenAI: GPT-5.5openai

AI 54.81.1M ctx$5.00/M in

OpenAI: GPT-5.4openai

AI 51.41.1M ctx$2.50/M in

Z.ai: GLM 5.2z-ai

AI 51.11M ctx$0.7700/M in

Anthropic: Claude Sonnet 4.6anthropic

AI 47.21M ctx$3.00/M in

Qwen: Qwen3.7 Maxqwen

AI 461M ctx$1.25/M in

Anthropic: Claude Opus 4.6anthropic

AI 37.81M ctx$5.00/M in

DeepSeek: DeepSeek V4 Prodeepseek

AI 44.31M ctx$0.4350/M in

xAI: Grok 4x-ai

AI 33.3256K ctx$3.00/M in

Inception: Mercury 2inception

AI 25.3128K ctx$0.2500/M in

DeepSeek R1 Distill Qwen 14BDeepSeek

AI 9.8Free/M in

MoonshotAI: Kimi K2 0905moonshotai

AI 23.5262K ctx$0.6000/M in

Qwen: Qwen3 32Bqwen

AI 11.5131K ctx$0.0800/M in

Xiaomi: MiMo-V2.5-Proxiaomi

AI 42.21M ctx$0.4350/M in

Mistral: Mistral Medium 3.5mistralai

AI 29.9262K ctx$1.50/M in

Z.ai: GLM 4.5z-ai

AI 19.5131K ctx$0.6000/M in

OpenAI: o3 Mini Highopenai

AI 15.6200K ctx$1.10/M in

Hermes 4 – Llama-3.1 405B (Reasoning)Nous Research

AI 9$1.00/M in

MoonshotAI: Kimi K2.7 Codemoonshotai

AI 41.9262K ctx$0.7400/M in

Qwen: Qwen3 Maxqwen

AI 24262K ctx$0.7800/M in

TNG: DeepSeek R1T2 Chimeratngtech

AI 27164K ctx$0.3000/M in

DeepSeek-Coder-V2DeepSeek

AI 5.1Free/M in

OpenAI: GPT-5.2openai

AI 26400K ctx$1.75/M in

Qwen: Qwen3 30B A3B Thinking 2507qwen

AI 14.4131K ctx$0.1300/M in

Qwen: Qwen3 235B A22Bqwen

AI 13.4131K ctx$0.4550/M in

OpenAI: GPT-5.1-Codexopenai

AI 34.7400K ctx$1.25/M in

Z.ai: GLM 4.5 Airz-ai

AI 16.5131K ctx$0.1300/M in

OpenAI: o3 Miniopenai

AI 19200K ctx$1.10/M in

Hermes 4 – Llama-3.1 70B (Reasoning)Nous Research

AI 10$0.1300/M in

Qwen: Qwen3.5 397B A17Bqwen

AI 33.7256K ctx$0.3850/M in

Google: Gemini 2.5 Progoogle

AI 25.81M ctx$1.25/M in

OpenAI: GPT-5.4 Nanoopenai

AI 38.2400K ctx$0.2000/M in

DeepSeek R1 Distill Llama 8BDeepSeek

AI 6.4Free/M in

Mistral: Devstral 2 2512mistralai

AI 19.2262K ctx$0.4000/M in

xAI: Grok Code Fast 1x-ai

AI 21.6256K ctx$0.2000/M in

OpenAI: o3openai

AI 30.4200K ctx$2.00/M in

DeepSeek: DeepSeek V4 Flashdeepseek

AI 40.31M ctx$0.0900/M in

inclusionAI: Ring-2.6-1Tinclusionai

AI 30.6262K ctx$0.0750/M in

OpenAI: GPT-5.1-Codex-Miniopenai

AI 30.6400K ctx$0.2500/M in

DeepSeek: R1deepseek

AI 18.5164K ctx$0.7000/M in

ERNIE 5.0 Thinking PreviewBaidu

AI 21.9Free/M in

Nex AGI: Nex-N2-Pronex-agi

AI 41262K ctx$0.2500/M in

Z.ai: GLM 5z-ai

AI 39.5203K ctx$0.6000/M in

OpenAI: GPT-5 Codexopenai

AI 36.1400K ctx$1.25/M in

xAI: Grok 3 Minix-ai

AI 22.5131K ctx$0.3000/M in

How we rank AI models

The Design for Online AI Model Leaderboard scores 592 models on a single 0–100 scale built from four weighted dimensions: intelligence (reasoning and knowledge benchmarks), technical capability (coding and tool use), content quality (writing and instruction-following) and value (capability per dollar).

Underlying data is aggregated from the OpenRouter API for pricing and availability, Artificial Analysis for intelligence, coding and agentic indices, and the Hugging Face Open LLM Leaderboard for open-model benchmarks. We refresh these sources daily and layer our own editorial review on top, so a model that benchmarks well but is impractical to deploy will not automatically top the table.

Models are grouped into tiers (Frontier, Professional, Specialist, Efficient, Emerging and Legacy) to make like-for-like comparison easier, and newly released models are flagged so you can see what has just landed.

Leaderboard FAQ

How often is the leaderboard updated?

Pricing, availability and benchmark data are synced daily from our sources, and editorial scores are reviewed whenever a significant new model is released.

How is the overall score calculated?

Each model is graded 0–10 on intelligence, technical capability, content quality and value; those dimensions are weighted and combined into the 0–100 overall score used to rank the table.

Where does the data come from?

From the OpenRouter API, Artificial Analysis and the Hugging Face Open LLM Leaderboard, combined with hands-on editorial testing by the Design for Online team.