AI Agents

Models optimised for autonomous agent workflows.

Updated June 12, 2026

99 Models tracked

22 Providers

Daily Data refresh

Leading right now Anthropic: Claude Fable 5 1M context · $10.00/1M in 94.7 Score

Models optimised for autonomous agent workflows.

#	Model	Score	AI Index	Context	Input / 1M	Output / 1M
1	Anthropic: Claude Fable 5anthropic New Top Pick	94.7	64.9	1M	$10.00	$50.00
2	Anthropic: Claude Opus 4.8anthropic New Top Pick In-House Pick	92.4	61.4	1M	$5.00	$25.00
3	Google: Gemini 3.1 Pro Previewgoogle Best for Agents	91.7	57.2	1M	$2.00	$12.00
4	OpenAI: GPT-5.5openai Top Pick	88.8	60.2	1.1M	$5.00	$30.00
5	Anthropic: Claude Opus 4.7anthropic	88.3	57.3	1M	$5.00	$25.00
6	Anthropic: Claude Sonnet 4.6anthropic In-House Pick	84.4	44.4	1M	$3.00	$15.00
7	Qwen: Qwen3.7 Maxqwen New	83.9	56.6	1M	$1.25	$3.75
8	OpenAI: GPT-5.3-Codexopenai	83.5	53.6	400K	$1.75	$14.00
9	Google: Gemini 3 Flash Previewgoogle	82.5	35	1M	$0.5000	$3.00
10	xAI: Grok Code Fast 1x-ai	82	28.7	256K	$0.2000	$1.50
11	MiniMax: MiniMax M3minimax New	82	54.7	1M	$0.3000	$1.20
12	Z.ai: GLM 4.7z-ai	82	34.2	203K	$0.4000	$1.75
13	Anthropic: Claude Sonnet 4anthropic	82	33	1M	$3.00	$15.00
14	OpenAI: GPT-5.1openai	82	47.7	400K	$1.25	$10.00
15	Z.ai: GLM 5 Turboz-ai	82	46.8	262K	$1.20	$4.00
16	Mistral: Mistral Medium 3.5mistralai	82	39.2	262K	$1.50	$7.50
17	OpenAI: GPT-5openai	82	44.6	400K	$1.25	$10.00
18	Qwen: Qwen3.6 Plusqwen	82	50	1M	$0.3250	$1.95
19	Qwen: Qwen3.7 Plusqwen New	82	53.3	1M	$0.3200	$1.28
20	Google: Gemini 2.5 Pro Preview 05-06google	82	29.5	1M	$1.25	$10.00
21	GPT-5.5 (high)OpenAI Best for Coding	82	58.9		$5.00	$30.00
22	OpenAI: GPT-5.1-Codexopenai	82	43.1	400K	$1.25	$10.00
23	OpenAI: GPT-5.4 Nanoopenai	82	44	400K	$0.2000	$1.25
24	inclusionAI: Ling-2.6-1Tinclusionai	82	33.6	262K	$0.0750	$0.6250
25	OpenAI: GPT-5 Miniopenai	82	41.2	400K	$0.2500	$2.00
26	NVIDIA: Nemotron 3 Ultranvidia New	82	47.7	1M	$0.5000	$2.50
27	Xiaomi: MiMo-V2-Flashxiaomi	82	30.3	262K	$0.1000	$0.3000
28	OpenAI: o3openai	82	38.4	200K	$2.00	$8.00
29	Muse SparkMeta	82	52.2		Free	Free
30	MoonshotAI: Kimi K2 Thinkingmoonshotai	82	40.9	262K	$0.6000	$2.50
31	MiniMax: MiniMax M2.7minimax Updated	82	49.6	205K	$0.2500	$1.00
32	OpenAI: gpt-oss-20bopenai	82	20.8	131K	$0.0290	$0.1400
33	MoonshotAI: Kimi K2.6moonshotai Updated	82	53.9	262K	$0.6800	$3.41
34	OpenAI: o4 Miniopenai	82	33.1	200K	$1.10	$4.40
35	Nova 2.0 Lite (high)Amazon	82	34.5		$0.3000	$2.50
36	MiniMax: MiniMax M2minimax	82	36.1	205K	$0.2550	$1.00
37	Xiaomi: MiMo-V2-Omnixiaomi	82	43.4	262K	$0.4000	$2.00
38	inclusionAI: Ring-2.6-1Tinclusionai	82	38.5	262K	$0.0750	$0.6250
39	Qwen: Qwen3.5 397B A17Bqwen	82	40.1	262K	$0.3900	$2.34
40	Anthropic: Claude Opus 4.1anthropic	82	42	200K	$15.00	$75.00
41	Xiaomi: MiMo-V2.5xiaomi	82	49	1M	$0.1400	$0.2800
42	Anthropic: Claude 3.7 Sonnetanthropic	82	30.8	200K	$3.00	$15.00
43	Nova 2.0 Lite (medium)Amazon	82	29.7		$0.3000	$2.50
44	OpenAI: o3 Deep Researchopenai	82	38.3	200K	$10.00	$40.00
45	Xiaomi: MiMo-V2-Proxiaomi	82	49.2	1M	$1.00	$3.00
46	Google: Gemini 3.5 Flashgoogle New	82	54.8	1M	$1.50	$9.00
47	MiniMax: MiniMax M2.5minimax	82	41.9	205K	$0.1500	$0.9000
48	MoonshotAI: Kimi K2 0711moonshotai	82	26.3	131K	$0.5700	$2.30
49	Tencent: Hy3 preview (free)tencent	82	41.9	262K	Free	Free
50	OpenAI: GPT-5.2openai	82	51.3	400K	$1.75	$14.00

Anthropic: Claude Fable 5anthropic

AI 64.91M ctx$10.00/M in

Anthropic: Claude Opus 4.8anthropic

AI 61.41M ctx$5.00/M in

Google: Gemini 3.1 Pro Previewgoogle

AI 57.21M ctx$2.00/M in

OpenAI: GPT-5.5openai

AI 60.21.1M ctx$5.00/M in

Anthropic: Claude Opus 4.7anthropic

AI 57.31M ctx$5.00/M in

Anthropic: Claude Sonnet 4.6anthropic

AI 44.41M ctx$3.00/M in

Qwen: Qwen3.7 Maxqwen

AI 56.61M ctx$1.25/M in

OpenAI: GPT-5.3-Codexopenai

AI 53.6400K ctx$1.75/M in

Google: Gemini 3 Flash Previewgoogle

AI 351M ctx$0.5000/M in

xAI: Grok Code Fast 1x-ai

AI 28.7256K ctx$0.2000/M in

MiniMax: MiniMax M3minimax

AI 54.71M ctx$0.3000/M in

Z.ai: GLM 4.7z-ai

AI 34.2203K ctx$0.4000/M in

Anthropic: Claude Sonnet 4anthropic

AI 331M ctx$3.00/M in

OpenAI: GPT-5.1openai

AI 47.7400K ctx$1.25/M in

Z.ai: GLM 5 Turboz-ai

AI 46.8262K ctx$1.20/M in

Mistral: Mistral Medium 3.5mistralai

AI 39.2262K ctx$1.50/M in

OpenAI: GPT-5openai

AI 44.6400K ctx$1.25/M in

Qwen: Qwen3.6 Plusqwen

AI 501M ctx$0.3250/M in

Qwen: Qwen3.7 Plusqwen

AI 53.31M ctx$0.3200/M in

Google: Gemini 2.5 Pro Preview 05-06google

AI 29.51M ctx$1.25/M in

GPT-5.5 (high)OpenAI

AI 58.9$5.00/M in

OpenAI: GPT-5.1-Codexopenai

AI 43.1400K ctx$1.25/M in

OpenAI: GPT-5.4 Nanoopenai

AI 44400K ctx$0.2000/M in

inclusionAI: Ling-2.6-1Tinclusionai

AI 33.6262K ctx$0.0750/M in

OpenAI: GPT-5 Miniopenai

AI 41.2400K ctx$0.2500/M in

NVIDIA: Nemotron 3 Ultranvidia

AI 47.71M ctx$0.5000/M in

Xiaomi: MiMo-V2-Flashxiaomi

AI 30.3262K ctx$0.1000/M in

OpenAI: o3openai

AI 38.4200K ctx$2.00/M in

Muse SparkMeta

AI 52.2Free/M in

MoonshotAI: Kimi K2 Thinkingmoonshotai

AI 40.9262K ctx$0.6000/M in

MiniMax: MiniMax M2.7minimax

AI 49.6205K ctx$0.2500/M in

OpenAI: gpt-oss-20bopenai

AI 20.8131K ctx$0.0290/M in

MoonshotAI: Kimi K2.6moonshotai

AI 53.9262K ctx$0.6800/M in

OpenAI: o4 Miniopenai

AI 33.1200K ctx$1.10/M in

Nova 2.0 Lite (high)Amazon

AI 34.5$0.3000/M in

MiniMax: MiniMax M2minimax

AI 36.1205K ctx$0.2550/M in

Xiaomi: MiMo-V2-Omnixiaomi

AI 43.4262K ctx$0.4000/M in

inclusionAI: Ring-2.6-1Tinclusionai

AI 38.5262K ctx$0.0750/M in

Qwen: Qwen3.5 397B A17Bqwen

AI 40.1262K ctx$0.3900/M in

Anthropic: Claude Opus 4.1anthropic

AI 42200K ctx$15.00/M in

Xiaomi: MiMo-V2.5xiaomi

AI 491M ctx$0.1400/M in

Anthropic: Claude 3.7 Sonnetanthropic

AI 30.8200K ctx$3.00/M in

Nova 2.0 Lite (medium)Amazon

AI 29.7$0.3000/M in

OpenAI: o3 Deep Researchopenai

AI 38.3200K ctx$10.00/M in

Xiaomi: MiMo-V2-Proxiaomi

AI 49.21M ctx$1.00/M in

Google: Gemini 3.5 Flashgoogle

AI 54.81M ctx$1.50/M in

MiniMax: MiniMax M2.5minimax

AI 41.9205K ctx$0.1500/M in

MoonshotAI: Kimi K2 0711moonshotai

AI 26.3131K ctx$0.5700/M in

Tencent: Hy3 preview (free)tencent

AI 41.9262K ctxFree/M in

OpenAI: GPT-5.2openai

AI 51.3400K ctx$1.75/M in

How we rank AI models

The Design for Online AI Model Leaderboard scores 578 models on a single 0–100 scale built from four weighted dimensions: intelligence (reasoning and knowledge benchmarks), technical capability (coding and tool use), content quality (writing and instruction-following) and value (capability per dollar).

Underlying data is aggregated from the OpenRouter API for pricing and availability, Artificial Analysis for intelligence, coding and agentic indices, and the Hugging Face Open LLM Leaderboard for open-model benchmarks. We refresh these sources daily and layer our own editorial review on top, so a model that benchmarks well but is impractical to deploy will not automatically top the table.

Models are grouped into tiers (Frontier, Professional, Specialist, Efficient, Emerging and Legacy) to make like-for-like comparison easier, and newly released models are flagged so you can see what has just landed.

Leaderboard FAQ

How often is the leaderboard updated?

Pricing, availability and benchmark data are synced daily from our sources, and editorial scores are reviewed whenever a significant new model is released.

How is the overall score calculated?

Each model is graded 0–10 on intelligence, technical capability, content quality and value; those dimensions are weighted and combined into the 0–100 overall score used to rank the table.

Where does the data come from?

From the OpenRouter API, Artificial Analysis and the Hugging Face Open LLM Leaderboard, combined with hands-on editorial testing by the Design for Online team.