Tool Use

Models with strong tool-use and function-calling support.

Updated June 12, 2026

Models with strong tool-use and function-calling support.

# Model Score AI Index Context Input / 1M Output / 1M Caps
1Anthropic: Claude Fable 5anthropic New Top Pick94.764.91M$10.00$50.00
2Anthropic: Claude Opus 4.8anthropic New Top Pick In-House Pick92.461.41M$5.00$25.00
3Google: Gemini 3.1 Pro Previewgoogle Best for Agents91.757.21M$2.00$12.00
4OpenAI: GPT-5.5openai Top Pick88.860.21.1M$5.00$30.00
5Anthropic: Claude Opus 4.7anthropic88.357.31M$5.00$25.00
6Anthropic: Claude Sonnet 4.6anthropic In-House Pick84.444.41M$3.00$15.00
7Qwen: Qwen3.7 Maxqwen New83.956.61M$1.25$3.75
8OpenAI: GPT-5.3-Codexopenai83.553.6400K$1.75$14.00
9Google: Gemini 3 Flash Previewgoogle82.5351M$0.5000$3.00
10Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google8248.466K$2.00$12.00
11xAI: Grok Code Fast 1x-ai8228.7256K$0.2000$1.50
12OpenAI: o3 Proopenai8240.7200K$20.00$80.00
13Anthropic: Claude 3.7 Sonnet (thinking)anthropic8234.7200K$3.00$15.00
14OpenAI: GPT-4o-miniopenai8212.6128K$0.1500$0.6000
15Qwen3 Coder 480B A35B InstructAlibaba8224.8$1.50$7.50
16MiniMax: MiniMax M3minimax New8254.71M$0.3000$1.20
17Xiaomi: MiMo-V2-Flashxiaomi8230.3262K$0.1000$0.3000
18Qwen: Qwen3 VL 8B Thinkingqwen8216.7256K$0.1170$1.37
19Qwen: Qwen3 30B A3B Instruct 2507qwen8215131K$0.0482$0.1931
20Qwen: Qwen3 8Bqwen8210.6131K$0.0500$0.4000
21Meta: Llama 3.3 70B Instruct (free)meta-llama8214.5131KFreeFree
22Mistral: Mixtral 8x7B Instructmistralai827.733K$0.5400$0.5400
23Qwen: Qwen3.6 Plusqwen82501M$0.3250$1.95
24inclusionAI: Ling-2.6-flashinclusionai8226.2262K$0.0100$0.0300
25Qwen: Qwen3 VL 235B A22B Thinkingqwen8227.6131K$0.2600$2.60
26MoonshotAI: Kimi K2 0711moonshotai8226.3131K$0.5700$2.30
27xAI: Grok 4.20 Multi-Agent Betax-ai8248.52M$2.00$6.00
28GPT-5.5 (Non-reasoning)OpenAI8240.9$5.00$30.00
29MoonshotAI: Kimi K2.5moonshotai8237.3262K$0.4000$1.90
30xAI: Grok 4.1 Fastx-ai8223.62M$0.2000$0.5000
31OpenAI: GPT-4o Audioopenai8212.8128K$2.50$10.00
32xAI: Grok 3 Minix-ai8232.1131K$0.3000$0.5000
33OpenAI: o3 Mini Highopenai8225.2200K$1.10$4.40
34Kwaipilot: KAT-Coder-Pro V2kwaipilot8243.8256K$0.3000$1.20
35Qwen: Qwen3.7 Plusqwen New8253.31M$0.3200$1.28
36Qwen: Qwen3 VL 8B Instructqwen8214.3256K$0.0800$0.5000
37Z.ai: GLM 4.5z-ai8226.4131K$0.6000$2.20
38Qwen: Qwen3 14Bqwen8216.2132K$0.1000$0.2400
39IBM: Granite 4.1 8Bibm-granite8212.4131K$0.0500$0.1000
40Qwen: Qwen3 VL 235B A22B Instructqwen8220.8262K$0.2000$0.8800
41Mistral: Devstral Mediummistralai8218.7131K$0.4000$2.00
42Meta: Llama 4 Maverickmeta-llama8218.41M$0.1500$0.6000
43NVIDIA: Llama 3.1 Nemotron 70B Instructnvidia8213.4131K$1.20$1.20
44GPT-5.5 (medium)OpenAI Best for Agents8256.7$5.00$30.00
45inclusionAI: Ring-2.6-1Tinclusionai8238.5262K$0.0750$0.6250
46Google: Gemini 3 Pro Previewgoogle8241.31M$2.00$12.00
47Mistral: Mistral Medium 3.1mistralai8221.3131K$0.4000$2.00
48Google: Gemini 2.5 Pro Preview 06-05google8230.31M$1.25$10.00
49Google: Gemini 2.0 Flashgoogle8216.81M$0.1000$0.4000
50OpenAI: GPT-4o (2024-05-13)openai8214.5128K$5.00$15.00
#1NewTop Pick94.7
Anthropic: Claude Fable 5anthropic
AI 64.91M ctx$10.00/M in
#2NewTop PickIn-House Pick92.4
Anthropic: Claude Opus 4.8anthropic
AI 61.41M ctx$5.00/M in
#3Best for Agents91.7
Google: Gemini 3.1 Pro Previewgoogle
AI 57.21M ctx$2.00/M in
#4Top Pick88.8
OpenAI: GPT-5.5openai
AI 60.21.1M ctx$5.00/M in
#588.3
Anthropic: Claude Opus 4.7anthropic
AI 57.31M ctx$5.00/M in
#6In-House Pick84.4
Anthropic: Claude Sonnet 4.6anthropic
AI 44.41M ctx$3.00/M in
#7New83.9
Qwen: Qwen3.7 Maxqwen
AI 56.61M ctx$1.25/M in
#883.5
OpenAI: GPT-5.3-Codexopenai
AI 53.6400K ctx$1.75/M in
#982.5
Google: Gemini 3 Flash Previewgoogle
AI 351M ctx$0.5000/M in
#1082
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google
AI 48.466K ctx$2.00/M in
#1182
xAI: Grok Code Fast 1x-ai
AI 28.7256K ctx$0.2000/M in
#1282
OpenAI: o3 Proopenai
AI 40.7200K ctx$20.00/M in
#1382
Anthropic: Claude 3.7 Sonnet (thinking)anthropic
AI 34.7200K ctx$3.00/M in
#1482
OpenAI: GPT-4o-miniopenai
AI 12.6128K ctx$0.1500/M in
#1582
Qwen3 Coder 480B A35B InstructAlibaba
AI 24.8$1.50/M in
#16New82
MiniMax: MiniMax M3minimax
AI 54.71M ctx$0.3000/M in
#1782
Xiaomi: MiMo-V2-Flashxiaomi
AI 30.3262K ctx$0.1000/M in
#1882
Qwen: Qwen3 VL 8B Thinkingqwen
AI 16.7256K ctx$0.1170/M in
#1982
Qwen: Qwen3 30B A3B Instruct 2507qwen
AI 15131K ctx$0.0482/M in
#2082
Qwen: Qwen3 8Bqwen
AI 10.6131K ctx$0.0500/M in
#2182
Meta: Llama 3.3 70B Instruct (free)meta-llama
AI 14.5131K ctxFree/M in
#2282
Mistral: Mixtral 8x7B Instructmistralai
AI 7.733K ctx$0.5400/M in
#2382
Qwen: Qwen3.6 Plusqwen
AI 501M ctx$0.3250/M in
#2482
inclusionAI: Ling-2.6-flashinclusionai
AI 26.2262K ctx$0.0100/M in
#2582
Qwen: Qwen3 VL 235B A22B Thinkingqwen
AI 27.6131K ctx$0.2600/M in
#2682
MoonshotAI: Kimi K2 0711moonshotai
AI 26.3131K ctx$0.5700/M in
#2782
xAI: Grok 4.20 Multi-Agent Betax-ai
AI 48.52M ctx$2.00/M in
#2882
GPT-5.5 (Non-reasoning)OpenAI
AI 40.9$5.00/M in
#2982
MoonshotAI: Kimi K2.5moonshotai
AI 37.3262K ctx$0.4000/M in
#3082
xAI: Grok 4.1 Fastx-ai
AI 23.62M ctx$0.2000/M in
#3182
OpenAI: GPT-4o Audioopenai
AI 12.8128K ctx$2.50/M in
#3282
xAI: Grok 3 Minix-ai
AI 32.1131K ctx$0.3000/M in
#3382
OpenAI: o3 Mini Highopenai
AI 25.2200K ctx$1.10/M in
#3482
Kwaipilot: KAT-Coder-Pro V2kwaipilot
AI 43.8256K ctx$0.3000/M in
#35New82
Qwen: Qwen3.7 Plusqwen
AI 53.31M ctx$0.3200/M in
#3682
Qwen: Qwen3 VL 8B Instructqwen
AI 14.3256K ctx$0.0800/M in
#3782
Z.ai: GLM 4.5z-ai
AI 26.4131K ctx$0.6000/M in
#3882
Qwen: Qwen3 14Bqwen
AI 16.2132K ctx$0.1000/M in
#3982
IBM: Granite 4.1 8Bibm-granite
AI 12.4131K ctx$0.0500/M in
#4082
Qwen: Qwen3 VL 235B A22B Instructqwen
AI 20.8262K ctx$0.2000/M in
#4182
Mistral: Devstral Mediummistralai
AI 18.7131K ctx$0.4000/M in
#4282
Meta: Llama 4 Maverickmeta-llama
AI 18.41M ctx$0.1500/M in
#4382
NVIDIA: Llama 3.1 Nemotron 70B Instructnvidia
AI 13.4131K ctx$1.20/M in
#44Best for Agents82
GPT-5.5 (medium)OpenAI
AI 56.7$5.00/M in
#4582
inclusionAI: Ring-2.6-1Tinclusionai
AI 38.5262K ctx$0.0750/M in
#4682
Google: Gemini 3 Pro Previewgoogle
AI 41.31M ctx$2.00/M in
#4782
Mistral: Mistral Medium 3.1mistralai
AI 21.3131K ctx$0.4000/M in
#4882
Google: Gemini 2.5 Pro Preview 06-05google
AI 30.31M ctx$1.25/M in
#4982
Google: Gemini 2.0 Flashgoogle
AI 16.81M ctx$0.1000/M in
#5082
OpenAI: GPT-4o (2024-05-13)openai
AI 14.5128K ctx$5.00/M in

How we rank AI models

The Design for Online AI Model Leaderboard scores 578 models on a single 0–100 scale built from four weighted dimensions: intelligence (reasoning and knowledge benchmarks), technical capability (coding and tool use), content quality (writing and instruction-following) and value (capability per dollar).

Underlying data is aggregated from the OpenRouter API for pricing and availability, Artificial Analysis for intelligence, coding and agentic indices, and the Hugging Face Open LLM Leaderboard for open-model benchmarks. We refresh these sources daily and layer our own editorial review on top, so a model that benchmarks well but is impractical to deploy will not automatically top the table.

Models are grouped into tiers (Frontier, Professional, Specialist, Efficient, Emerging and Legacy) to make like-for-like comparison easier, and newly released models are flagged so you can see what has just landed.

Leaderboard FAQ

How often is the leaderboard updated?

Pricing, availability and benchmark data are synced daily from our sources, and editorial scores are reviewed whenever a significant new model is released.

How is the overall score calculated?

Each model is graded 0–10 on intelligence, technical capability, content quality and value; those dimensions are weighted and combined into the 0–100 overall score used to rank the table.

Where does the data come from?

From the OpenRouter API, Artificial Analysis and the Hugging Face Open LLM Leaderboard, combined with hands-on editorial testing by the Design for Online team.