AI Agents
Models optimised for autonomous agent workflows.
| # | Model | Score | AI Index | Context | Input / 1M | Output / 1M | Caps | |
|---|---|---|---|---|---|---|---|---|
| 1 | Google: Gemini 3.1 Pro Previewgoogle New Top Pick | 97 | 57.2 | 1M | $2.00 | $12.00 | TV | |
| 2 | OpenAI: GPT-5.4openai New Top Pick | 95 | 57 | 1.1M | $2.50 | $15.00 | TV | |
| 3 | Anthropic: Claude Opus 4.6anthropic Best for Coding | 93 | 53 | 1M | $5.00 | $25.00 | TV | |
| 4 | OpenAI: GPT-5.3-Codexopenai New Top Pick | 92 | 54 | 400K | $1.75 | $14.00 | TV | |
| 5 | Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google | 87 | 48.4 | 66K | $2.00 | $12.00 | V | |
| 6 | OpenAI: GPT-5.1openai | 87 | 47.7 | 400K | $1.25 | $10.00 | TV | |
| 7 | OpenAI: GPT-5.2-Codexopenai | 87 | 49 | 400K | $1.75 | $14.00 | TV | |
| 8 | Google: Gemini 3 Flash Previewgoogle | 87 | 35 | 1M | $0.5000 | $3.00 | TV | |
| 9 | Qwen: Qwen3.5 397B A17Bqwen New | 86 | 45 | 262K | $0.3900 | $2.34 | TV | |
| 10 | OpenAI: GPT-5 Codexopenai | 82 | 44.6 | 400K | $1.25 | $10.00 | TV | |
| 11 | Anthropic: Claude Sonnet 4.6anthropic New | 82 | 51.7 | 1M | $3.00 | $15.00 | TV | |
| 12 | Google: Gemini 3 Pro Previewgoogle | 80 | 41.3 | 1M | $2.00 | $12.00 | TV | |
| 13 | OpenAI: GPT-5.1-Codexopenai | 80 | 43.1 | 400K | $1.25 | $10.00 | TV | |
| 14 | OpenAI: GPT-5 Miniopenai | 80 | 41.2 | 400K | $0.2500 | $2.00 | TV | |
| 15 | xAI: Grok 4x-ai | 80 | 41.5 | 256K | $3.00 | $15.00 | TV | |
| 16 | Anthropic: Claude Opus 4.5anthropic | 80 | 43.1 | 200K | $5.00 | $25.00 | TV | |
| 17 | MiniMax: MiniMax M2.5minimax | 80 | 41.9 | 197K | $0.2500 | $1.20 | T | |
| 18 | Qwen: Qwen3.5-27Bqwen New | 79 | 37.2 | 262K | $0.1950 | $1.56 | TV | |
| 19 | MoonshotAI: Kimi K2 Thinkingmoonshotai | 79 | 40.9 | 131K | $0.4700 | $2.00 | T | |
| 20 | Z.ai: GLM 5z-ai Best for Agents | 78 | 49.8 | 203K | $0.7200 | $2.30 | T | |
| 21 | MiniMax: MiniMax M2.1minimax | 78 | 39.4 | 197K | $0.2700 | $0.9500 | T | |
| 22 | Z.ai: GLM 4.7z-ai | 78 | 42.1 | 203K | $0.3800 | $1.98 | T | |
| 23 | DeepSeek: DeepSeek V3.2deepseek | 78 | 41.7 | 164K | $0.2600 | $0.3800 | T | |
| 24 | Qwen: Qwen3.5-122B-A10Bqwen New | 78 | 35.9 | 262K | $0.2600 | $2.08 | TV | |
| 25 | Google: Gemini 2.5 Progoogle | 76 | 34.6 | 1M | $1.25 | $10.00 | TV | |
| 26 | Anthropic: Claude Sonnet 4.5anthropic | 74 | 43 | 1M | $3.00 | $15.00 | TV | |
| 27 | Anthropic: Claude Opus 4.1anthropic | 70 | 31.9 | 200K | $15.00 | $75.00 | TV | |
| 28 | Qwen: Qwen3.5-35B-A3Bqwen New | 70 | 37.1 | 262K | $0.1625 | $1.30 | TV | |
| 29 | StepFun: Step 3.5 Flashstepfun Best Value | 68 | 37.8 | 256K | $0.1000 | $0.3000 | T | |
| 30 | xAI: Grok 4.20 Multi-Agent Betax-ai New | 68 | 29.7 | 2M | $2.00 | $6.00 | TV | |
| 31 | OpenAI: GPT-5.1-Codex-Miniopenai | 68 | 38.6 | 400K | $0.2500 | $2.00 | TV | |
| 32 | MiniMax: MiniMax M2minimax | 68 | 36.1 | 197K | $0.2550 | $1.00 | T | |
| 33 | Qwen: Qwen3.5-9Bqwen New | 67 | 32.4 | 256K | $0.0500 | $0.1500 | TV | |
| 34 | OpenAI: gpt-oss-120bopenai | 67 | 24.5 | 131K | $0.0390 | $0.1900 | T | |
| 35 | Kwaipilot: KAT-Coder-Pro V1kwaipilot | 66 | 36 | 256K | $0.2070 | $0.8280 | T | |
| 36 | Xiaomi: MiMo-V2-Flashxiaomi | 66 | 30.4 | 262K | $0.0900 | $0.2900 | T | |
| 37 | MoonshotAI: Kimi K2 0905moonshotai | 65 | 30.9 | 131K | $0.4000 | $2.00 | T | |
| 38 | Z.ai: GLM 4.6z-ai | 65 | 32.5 | 205K | $0.3900 | $1.90 | T | |
| 39 | Qwen: Qwen3 Maxqwen | 65 | 31.4 | 262K | $1.20 | $6.00 | T | |
| 40 | MoonshotAI: Kimi K2 0711moonshotai | 63 | 26.3 | 131K | $0.5500 | $2.20 | T | |
| 41 | xAI: Grok Code Fast 1x-ai | 58 | 28.7 | 256K | $0.2000 | $1.50 | T |
#1NewTop Pick97
Google: Gemini 3.1 Pro Previewgoogle
Tool UseVision
#2NewTop Pick95
OpenAI: GPT-5.4openai
Tool UseVision
#3Best for Coding93
Anthropic: Claude Opus 4.6anthropic
Tool UseVision
#4NewTop Pick92
OpenAI: GPT-5.3-Codexopenai
Tool UseVision
#587
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google
Vision
#687
OpenAI: GPT-5.1openai
Tool UseVision
#787
OpenAI: GPT-5.2-Codexopenai
Tool UseVision
#887
Google: Gemini 3 Flash Previewgoogle
Tool UseVision
#9New86
Qwen: Qwen3.5 397B A17Bqwen
Tool UseVision
#1082
OpenAI: GPT-5 Codexopenai
Tool UseVision
#11New82
Anthropic: Claude Sonnet 4.6anthropic
Tool UseVision
#1280
Google: Gemini 3 Pro Previewgoogle
Tool UseVision
#1380
OpenAI: GPT-5.1-Codexopenai
Tool UseVision
#1480
OpenAI: GPT-5 Miniopenai
Tool UseVision
#1580
xAI: Grok 4x-ai
Tool UseVision
#1680
Anthropic: Claude Opus 4.5anthropic
Tool UseVision
#1780
MiniMax: MiniMax M2.5minimax
Tool Use
#18New79
Qwen: Qwen3.5-27Bqwen
Tool UseVision
#1979
MoonshotAI: Kimi K2 Thinkingmoonshotai
Tool Use
#20Best for Agents78
Z.ai: GLM 5z-ai
Tool Use
#2178
MiniMax: MiniMax M2.1minimax
Tool Use
#2278
Z.ai: GLM 4.7z-ai
Tool Use
#2378
DeepSeek: DeepSeek V3.2deepseek
Tool Use
#24New78
Qwen: Qwen3.5-122B-A10Bqwen
Tool UseVision
#2576
Google: Gemini 2.5 Progoogle
Tool UseVision
#2674
Anthropic: Claude Sonnet 4.5anthropic
Tool UseVision
#2770
Anthropic: Claude Opus 4.1anthropic
Tool UseVision
#28New70
Qwen: Qwen3.5-35B-A3Bqwen
Tool UseVision
#29Best Value68
StepFun: Step 3.5 Flashstepfun
Tool Use
#30New68
xAI: Grok 4.20 Multi-Agent Betax-ai
Tool UseVision
#3168
OpenAI: GPT-5.1-Codex-Miniopenai
Tool UseVision
#3268
MiniMax: MiniMax M2minimax
Tool Use
#33New67
Qwen: Qwen3.5-9Bqwen
Tool UseVision
#3467
OpenAI: gpt-oss-120bopenai
Tool Use
#3566
Kwaipilot: KAT-Coder-Pro V1kwaipilot
Tool Use
#3666
Xiaomi: MiMo-V2-Flashxiaomi
Tool Use
#3765
MoonshotAI: Kimi K2 0905moonshotai
Tool Use
#3865
Z.ai: GLM 4.6z-ai
Tool Use
#3965
Qwen: Qwen3 Maxqwen
Tool Use
#4063
MoonshotAI: Kimi K2 0711moonshotai
Tool Use
#4158
xAI: Grok Code Fast 1x-ai
Tool Use