AI Agents

Models optimised for autonomous agent workflows.

Updated May 20, 2026
# Model Score AI Index Context Input / 1M Output / 1M Caps
1Google: Gemini 3.1 Pro Previewgoogle Top Pick95.357.21M$2.00$12.00
2Anthropic: Claude Opus 4.7anthropic In-House Pick9457.31M$5.00$25.00
3OpenAI: GPT-5.5openai New Top Pick93.660.21.1M$5.00$30.00
4Anthropic: Claude Opus 4.6anthropic93.446.51M$5.00$25.00
5OpenAI: GPT-5.3-Codexopenai89.853.6400K$1.75$14.00
6OpenAI: GPT-5.4 Miniopenai87.148.9400K$0.7500$4.50
7GPT-5.5 (high)OpenAI New Top Pick86.758.9$5.00$30.00
8MoonshotAI: Kimi K2.6moonshotai86.453.9262K$0.7300$3.49
9GPT-5.5 (medium)OpenAI New Best for Coding85.756.7$5.00$30.00
10Google: Gemini 3 Flash Previewgoogle84.946.41M$0.5000$3.00
11Anthropic: Claude Sonnet 4.6anthropic In-House Pick84.944.41M$3.00$15.00
12Qwen: Qwen3.6 Plusqwen Best for Agents84.8501M$0.3250$1.95
13DeepSeek: DeepSeek V4 Prodeepseek New84.249.81M$0.4350$0.8700
14xAI: Grok 4.20 Multi-Agent Betax-ai82.748.52M$2.00$6.00
15Anthropic: Claude Opus 4.5anthropic82.549.7200K$5.00$25.00
16OpenAI: GPT-5.2-Codexopenai82.549400K$1.75$14.00
17Xiaomi: MiMo-V2.5xiaomi New82.5491M$0.4000$2.00
18DeepSeek: DeepSeek V4 Flashdeepseek New82.2461M$0.1120$0.2240
19Qwen: Qwen3.6 Max Previewqwen New82.151.8262K$1.04$6.24
20Anthropic: Claude Sonnet 4.5anthropic82.1431M$3.00$15.00
21GPT-5.5 (low)OpenAI New8250.8$5.00$30.00
22Z.ai: GLM 5.1z-ai8243.8203KFreeFree
23Muse SparkMeta8252.2FreeFree
24OpenAI: GPT-5.4 Nanoopenai8238.1400K$0.2000$1.25
25Xiaomi: MiMo-V2-Proxiaomi81.549.21M$1.00$3.00
26OpenAI: GPT-5.2openai81.151.3400K$1.75$14.00
27Z.ai: GLM 5z-ai80.249.8203K$0.6000$1.92
28Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google79.648.466K$2.00$12.00
29MiniMax: MiniMax M2.7minimax79.649.6205K$0.2790$1.20
30OpenAI: GPT-5 Codexopenai79.144.6400K$1.25$10.00
31Kwaipilot: KAT-Coder-Pro V2kwaipilot77.243.8256K$0.3000$1.20
32Google: Gemini 2.5 Pro Preview 06-05google77.130.31M$1.25$10.00
33Anthropic: Claude Opus 4.1anthropic76.942200K$15.00$75.00
34Google: Gemini 3 Pro Previewgoogle76.841.31M$2.00$12.00
35Qwen: Qwen3.6 35B A3Bqwen New76.843.5262K$0.1490$1.00
36Qwen: Qwen3.6 27Bqwen New76.545.8262K$0.3200$3.20
37OpenAI: GPT-5.1-Codexopenai76.543.1400K$1.25$10.00
38OpenAI: GPT-5 Miniopenai76.341.2400K$0.2500$2.00
39OpenAI: o3openai75.738.4200K$2.00$8.00
40Grok 4.20 0309 (Reasoning)xAI75.548.5$2.00$6.00
41MiniMax: MiniMax M2.5minimax75.341.9205K$0.1500$1.15
42Anthropic: Claude Opus 4anthropic74.939200K$15.00$75.00
43Anthropic: Claude Haiku 4.5anthropic74.731200K$1.00$5.00
44Qwen: Qwen3.5-27Bqwen74.142.1262K$0.1950$1.56
45DeepSeek: DeepSeek V3.2deepseek74.132.1131K$0.2520$0.3780
46Qwen: Qwen3.5-122B-A10Bqwen73.841.6262K$0.2600$2.08
47Xiaomi: MiMo-V2-Omnixiaomi73.843.4262K$0.4000$2.00
48Z.ai: GLM 5 Turboz-ai73.746.8203K$1.20$4.00
49xAI: Grok 4x-ai73.341.5256K$3.00$15.00
50MiniMax: MiniMax M2.1minimax7339.4205K$0.2900$0.9500
#1Top Pick95.3
Google: Gemini 3.1 Pro Previewgoogle
AI 57.21M ctx$2.00/M in
#2In-House Pick94
Anthropic: Claude Opus 4.7anthropic
AI 57.31M ctx$5.00/M in
#3NewTop Pick93.6
OpenAI: GPT-5.5openai
AI 60.21.1M ctx$5.00/M in
#493.4
Anthropic: Claude Opus 4.6anthropic
AI 46.51M ctx$5.00/M in
#589.8
OpenAI: GPT-5.3-Codexopenai
AI 53.6400K ctx$1.75/M in
#687.1
OpenAI: GPT-5.4 Miniopenai
AI 48.9400K ctx$0.7500/M in
#7NewTop Pick86.7
GPT-5.5 (high)OpenAI
AI 58.9$5.00/M in
#886.4
MoonshotAI: Kimi K2.6moonshotai
AI 53.9262K ctx$0.7300/M in
#9NewBest for Coding85.7
GPT-5.5 (medium)OpenAI
AI 56.7$5.00/M in
#1084.9
Google: Gemini 3 Flash Previewgoogle
AI 46.41M ctx$0.5000/M in
#11In-House Pick84.9
Anthropic: Claude Sonnet 4.6anthropic
AI 44.41M ctx$3.00/M in
#12Best for Agents84.8
Qwen: Qwen3.6 Plusqwen
AI 501M ctx$0.3250/M in
#13New84.2
DeepSeek: DeepSeek V4 Prodeepseek
AI 49.81M ctx$0.4350/M in
#1482.7
xAI: Grok 4.20 Multi-Agent Betax-ai
AI 48.52M ctx$2.00/M in
#1582.5
Anthropic: Claude Opus 4.5anthropic
AI 49.7200K ctx$5.00/M in
#1682.5
OpenAI: GPT-5.2-Codexopenai
AI 49400K ctx$1.75/M in
#17New82.5
Xiaomi: MiMo-V2.5xiaomi
AI 491M ctx$0.4000/M in
#18New82.2
DeepSeek: DeepSeek V4 Flashdeepseek
AI 461M ctx$0.1120/M in
#19New82.1
Qwen: Qwen3.6 Max Previewqwen
AI 51.8262K ctx$1.04/M in
#2082.1
Anthropic: Claude Sonnet 4.5anthropic
AI 431M ctx$3.00/M in
#21New82
GPT-5.5 (low)OpenAI
AI 50.8$5.00/M in
#2282
Z.ai: GLM 5.1z-ai
AI 43.8203K ctxFree/M in
#2382
Muse SparkMeta
AI 52.2Free/M in
#2482
OpenAI: GPT-5.4 Nanoopenai
AI 38.1400K ctx$0.2000/M in
#2581.5
Xiaomi: MiMo-V2-Proxiaomi
AI 49.21M ctx$1.00/M in
#2681.1
OpenAI: GPT-5.2openai
AI 51.3400K ctx$1.75/M in
#2780.2
Z.ai: GLM 5z-ai
AI 49.8203K ctx$0.6000/M in
#2879.6
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google
AI 48.466K ctx$2.00/M in
#2979.6
MiniMax: MiniMax M2.7minimax
AI 49.6205K ctx$0.2790/M in
#3079.1
OpenAI: GPT-5 Codexopenai
AI 44.6400K ctx$1.25/M in
#3177.2
Kwaipilot: KAT-Coder-Pro V2kwaipilot
AI 43.8256K ctx$0.3000/M in
#3277.1
Google: Gemini 2.5 Pro Preview 06-05google
AI 30.31M ctx$1.25/M in
#3376.9
Anthropic: Claude Opus 4.1anthropic
AI 42200K ctx$15.00/M in
#3476.8
Google: Gemini 3 Pro Previewgoogle
AI 41.31M ctx$2.00/M in
#35New76.8
Qwen: Qwen3.6 35B A3Bqwen
AI 43.5262K ctx$0.1490/M in
#36New76.5
Qwen: Qwen3.6 27Bqwen
AI 45.8262K ctx$0.3200/M in
#3776.5
OpenAI: GPT-5.1-Codexopenai
AI 43.1400K ctx$1.25/M in
#3876.3
OpenAI: GPT-5 Miniopenai
AI 41.2400K ctx$0.2500/M in
#3975.7
OpenAI: o3openai
AI 38.4200K ctx$2.00/M in
#4075.5
Grok 4.20 0309 (Reasoning)xAI
AI 48.5$2.00/M in
#4175.3
MiniMax: MiniMax M2.5minimax
AI 41.9205K ctx$0.1500/M in
#4274.9
Anthropic: Claude Opus 4anthropic
AI 39200K ctx$15.00/M in
#4374.7
Anthropic: Claude Haiku 4.5anthropic
AI 31200K ctx$1.00/M in
#4474.1
Qwen: Qwen3.5-27Bqwen
AI 42.1262K ctx$0.1950/M in
#4574.1
DeepSeek: DeepSeek V3.2deepseek
AI 32.1131K ctx$0.2520/M in
#4673.8
Qwen: Qwen3.5-122B-A10Bqwen
AI 41.6262K ctx$0.2600/M in
#4773.8
Xiaomi: MiMo-V2-Omnixiaomi
AI 43.4262K ctx$0.4000/M in
#4873.7
Z.ai: GLM 5 Turboz-ai
AI 46.8203K ctx$1.20/M in
#4973.3
xAI: Grok 4x-ai
AI 41.5256K ctx$3.00/M in
#5073
MiniMax: MiniMax M2.1minimax
AI 39.4205K ctx$0.2900/M in