Tool Use

Models with strong tool-use and function-calling support.

Updated June 12, 2026

Models with strong tool-use and function-calling support.

# Model Score AI Index Context Input / 1M Output / 1M Caps
1Anthropic: Claude Fable 5anthropic New Top Pick94.764.91M$10.00$50.00
2Anthropic: Claude Opus 4.8anthropic New Top Pick In-House Pick92.461.41M$5.00$25.00
3Google: Gemini 3.1 Pro Previewgoogle Best for Agents91.757.21M$2.00$12.00
4OpenAI: GPT-5.5openai Top Pick88.860.21.1M$5.00$30.00
5Anthropic: Claude Opus 4.7anthropic88.357.31M$5.00$25.00
6Anthropic: Claude Sonnet 4.6anthropic In-House Pick84.444.41M$3.00$15.00
7Qwen: Qwen3.7 Maxqwen New83.956.61M$1.25$3.75
8OpenAI: GPT-5.3-Codexopenai83.553.6400K$1.75$14.00
9Google: Gemini 3 Flash Previewgoogle82.5351M$0.5000$3.00
10Z.ai: GLM 4.6z-ai8232.5203K$0.4300$1.74
11OpenAI: GPT-4.1openai8226.31M$2.00$8.00
12Mistral Large 2407mistralai8213131K$2.00$6.00
13OpenAI: GPT-5.4openai8235.41.1M$2.50$15.00
14Tencent: Hy3 preview (free)tencent8241.9262KFreeFree
15MiniMax: MiniMax M1minimax8224.41M$0.4000$2.20
16Qwen: QwQ 32Bqwen8219.7131K$0.1500$0.5800
17Meta: Llama 3.1 70B Instructmeta-llama8212.5131K$0.4000$0.4000
18NVIDIA: Nemotron 3 Supernvidia82361M$0.0900$0.4500
19Qwen: Qwen3.6 27Bqwen Updated8245.8262K$0.2885$3.17
20MiniMax: MiniMax M2.1minimax8239.4205K$0.2900$0.9500
21MiniMax: MiniMax M2minimax8236.1205K$0.2550$1.00
22DeepSeek: R1deepseek8218.8164K$0.7000$2.50
23Mistral Largemistralai829.9128K$2.00$6.00
24Google: Gemma 4 26B A4Bgoogle8227.1262K$0.0600$0.3300
25Z.ai: GLM 4.6Vz-ai8223.4131K$0.3000$0.9000
26Anthropic: Claude Sonnet 4.5anthropic82431M$3.00$15.00
27OpenAI: GPT-4.1 Miniopenai8222.91M$0.4000$1.60
28Mistral: Pixtral Large 2411mistralai8214131K$2.00$6.00
29Inception: Mercury 2inception8232.8128K$0.2500$0.7500
30inclusionAI: Ling-2.6-1T (free)inclusionai8233.6262KFreeFree
31Tencent: Hy3 previewtencent8241.9262K$0.0630$0.2100
32DeepSeek: DeepSeek V3.2deepseek8232.1131K$0.2288$0.3432
33NVIDIA: Nemotron Nano 9B V2nvidia8213.2131K$0.0400$0.1600
34Google: Gemini 2.5 Flashgoogle8220.61M$0.3000$2.50
35Google: Gemini 2.0 Flash Litegoogle8218.51M$0.0750$0.3000
36Xiaomi: MiMo-V2-Omnixiaomi8243.4262K$0.4000$2.00
37Qwen: Qwen3.6 Max Previewqwen8251.8262K$1.04$6.24
38Z.ai: GLM 4.7z-ai8234.2203K$0.4000$1.75
39Qwen: Qwen3 VL 32B Instructqwen8217.2262K$0.1040$0.4160
40OpenAI: gpt-oss-20bopenai8220.8131K$0.0290$0.1400
41Qwen: Qwen3 4B (free)qwen8212.541KFreeFree
42DeepSeek: DeepSeek V3deepseek8216.5131K$0.2002$0.8001
43Z.ai: GLM 5.1z-ai8243.8203K$0.9800$3.08
44Nex AGI: DeepSeek V3.1 Nex N1nex-agi8228.1131K$0.1350$0.5000
45DeepSeek: DeepSeek V3.2 Expdeepseek8228.4164K$0.2700$0.4100
46Google: Gemini 2.5 Flash Litegoogle8212.71M$0.1000$0.4000
47OpenAI: GPT-4.1 Nanoopenai82131M$0.1000$0.4000
48Anthropic: Claude 3.5 Haikuanthropic8218.7200K$0.8000$4.00
49Qwen: Qwen3.5-9Bqwen8232.4262K$0.1000$0.1500
50DeepSeek: DeepSeek V4 Prodeepseek8251.51M$0.4350$0.8700
#1NewTop Pick94.7
Anthropic: Claude Fable 5anthropic
AI 64.91M ctx$10.00/M in
#2NewTop PickIn-House Pick92.4
Anthropic: Claude Opus 4.8anthropic
AI 61.41M ctx$5.00/M in
#3Best for Agents91.7
Google: Gemini 3.1 Pro Previewgoogle
AI 57.21M ctx$2.00/M in
#4Top Pick88.8
OpenAI: GPT-5.5openai
AI 60.21.1M ctx$5.00/M in
#588.3
Anthropic: Claude Opus 4.7anthropic
AI 57.31M ctx$5.00/M in
#6In-House Pick84.4
Anthropic: Claude Sonnet 4.6anthropic
AI 44.41M ctx$3.00/M in
#7New83.9
Qwen: Qwen3.7 Maxqwen
AI 56.61M ctx$1.25/M in
#883.5
OpenAI: GPT-5.3-Codexopenai
AI 53.6400K ctx$1.75/M in
#982.5
Google: Gemini 3 Flash Previewgoogle
AI 351M ctx$0.5000/M in
#1082
Z.ai: GLM 4.6z-ai
AI 32.5203K ctx$0.4300/M in
#1182
OpenAI: GPT-4.1openai
AI 26.31M ctx$2.00/M in
#1282
Mistral Large 2407mistralai
AI 13131K ctx$2.00/M in
#1382
OpenAI: GPT-5.4openai
AI 35.41.1M ctx$2.50/M in
#1482
Tencent: Hy3 preview (free)tencent
AI 41.9262K ctxFree/M in
#1582
MiniMax: MiniMax M1minimax
AI 24.41M ctx$0.4000/M in
#1682
Qwen: QwQ 32Bqwen
AI 19.7131K ctx$0.1500/M in
#1782
Meta: Llama 3.1 70B Instructmeta-llama
AI 12.5131K ctx$0.4000/M in
#1882
NVIDIA: Nemotron 3 Supernvidia
AI 361M ctx$0.0900/M in
#1982
Qwen: Qwen3.6 27Bqwen
AI 45.8262K ctx$0.2885/M in
#2082
MiniMax: MiniMax M2.1minimax
AI 39.4205K ctx$0.2900/M in
#2182
MiniMax: MiniMax M2minimax
AI 36.1205K ctx$0.2550/M in
#2282
DeepSeek: R1deepseek
AI 18.8164K ctx$0.7000/M in
#2382
Mistral Largemistralai
AI 9.9128K ctx$2.00/M in
#2482
Google: Gemma 4 26B A4Bgoogle
AI 27.1262K ctx$0.0600/M in
#2582
Z.ai: GLM 4.6Vz-ai
AI 23.4131K ctx$0.3000/M in
#2682
Anthropic: Claude Sonnet 4.5anthropic
AI 431M ctx$3.00/M in
#2782
OpenAI: GPT-4.1 Miniopenai
AI 22.91M ctx$0.4000/M in
#2882
Mistral: Pixtral Large 2411mistralai
AI 14131K ctx$2.00/M in
#2982
Inception: Mercury 2inception
AI 32.8128K ctx$0.2500/M in
#3082
inclusionAI: Ling-2.6-1T (free)inclusionai
AI 33.6262K ctxFree/M in
#3182
Tencent: Hy3 previewtencent
AI 41.9262K ctx$0.0630/M in
#3282
DeepSeek: DeepSeek V3.2deepseek
AI 32.1131K ctx$0.2288/M in
#3382
NVIDIA: Nemotron Nano 9B V2nvidia
AI 13.2131K ctx$0.0400/M in
#3482
Google: Gemini 2.5 Flashgoogle
AI 20.61M ctx$0.3000/M in
#3582
Google: Gemini 2.0 Flash Litegoogle
AI 18.51M ctx$0.0750/M in
#3682
Xiaomi: MiMo-V2-Omnixiaomi
AI 43.4262K ctx$0.4000/M in
#3782
Qwen: Qwen3.6 Max Previewqwen
AI 51.8262K ctx$1.04/M in
#3882
Z.ai: GLM 4.7z-ai
AI 34.2203K ctx$0.4000/M in
#3982
Qwen: Qwen3 VL 32B Instructqwen
AI 17.2262K ctx$0.1040/M in
#4082
OpenAI: gpt-oss-20bopenai
AI 20.8131K ctx$0.0290/M in
#4182
Qwen: Qwen3 4B (free)qwen
AI 12.541K ctxFree/M in
#4282
DeepSeek: DeepSeek V3deepseek
AI 16.5131K ctx$0.2002/M in
#4382
Z.ai: GLM 5.1z-ai
AI 43.8203K ctx$0.9800/M in
#4482
Nex AGI: DeepSeek V3.1 Nex N1nex-agi
AI 28.1131K ctx$0.1350/M in
#4582
DeepSeek: DeepSeek V3.2 Expdeepseek
AI 28.4164K ctx$0.2700/M in
#4682
Google: Gemini 2.5 Flash Litegoogle
AI 12.71M ctx$0.1000/M in
#4782
OpenAI: GPT-4.1 Nanoopenai
AI 131M ctx$0.1000/M in
#4882
Anthropic: Claude 3.5 Haikuanthropic
AI 18.7200K ctx$0.8000/M in
#4982
Qwen: Qwen3.5-9Bqwen
AI 32.4262K ctx$0.1000/M in
#5082
DeepSeek: DeepSeek V4 Prodeepseek
AI 51.51M ctx$0.4350/M in

How we rank AI models

The Design for Online AI Model Leaderboard scores 578 models on a single 0–100 scale built from four weighted dimensions: intelligence (reasoning and knowledge benchmarks), technical capability (coding and tool use), content quality (writing and instruction-following) and value (capability per dollar).

Underlying data is aggregated from the OpenRouter API for pricing and availability, Artificial Analysis for intelligence, coding and agentic indices, and the Hugging Face Open LLM Leaderboard for open-model benchmarks. We refresh these sources daily and layer our own editorial review on top, so a model that benchmarks well but is impractical to deploy will not automatically top the table.

Models are grouped into tiers (Frontier, Professional, Specialist, Efficient, Emerging and Legacy) to make like-for-like comparison easier, and newly released models are flagged so you can see what has just landed.

Leaderboard FAQ

How often is the leaderboard updated?

Pricing, availability and benchmark data are synced daily from our sources, and editorial scores are reviewed whenever a significant new model is released.

How is the overall score calculated?

Each model is graded 0–10 on intelligence, technical capability, content quality and value; those dimensions are weighted and combined into the 0–100 overall score used to rank the table.

Where does the data come from?

From the OpenRouter API, Artificial Analysis and the Hugging Face Open LLM Leaderboard, combined with hands-on editorial testing by the Design for Online team.