Tool Use

Models with strong tool-use and function-calling support.

Updated July 2, 2026

Models with strong tool-use and function-calling support.

# Model Score AI Index Context Input / 1M Output / 1M Caps
1Anthropic: Claude Fable 5anthropic New Top Pick93.359.91M$10.00$50.00
2Anthropic: Claude Sonnet 5anthropic New In-House Pick9353.41M$2.00$10.00
3Anthropic: Claude Opus 4.8anthropic Top Pick In-House Pick92.355.71M$5.00$25.00
4Google: Gemini 3.1 Pro Previewgoogle89.846.51M$2.00$12.00
5Anthropic: Claude Opus 4.7anthropic89.453.51M$5.00$25.00
6Google: Gemini 3.5 Flashgoogle88.950.21M$1.50$9.00
7OpenAI: GPT-5.5openai Top Pick88.354.81.1M$5.00$30.00
8OpenAI: GPT-5.4openai8751.41.1M$2.50$15.00
9Z.ai: GLM 5.2z-ai New86.151.11M$0.9300$3.00
10Anthropic: Claude Sonnet 4.6anthropic85.147.21M$3.00$15.00
11Qwen: Qwen3.7 Maxqwen83.5461M$1.25$3.75
12Anthropic: Claude Opus 4.6anthropic8343.71M$5.00$25.00
13DeepSeek: DeepSeek V4 Prodeepseek82.144.31M$0.4350$0.8700
14Nex AGI: DeepSeek V3.1 Nex N1nex-agi8221131K$0.1350$0.5000
15Z.ai: GLM 4.6z-ai8225.1203K$0.4300$1.74
16Qwen: Qwen3 Coder 30B A3B Instructqwen8213.6160K$0.0700$0.2700
17Qwen: Qwen3 32Bqwen8211.5131K$0.0800$0.2800
18Mistral: Mixtral 8x7B Instructmistralai822.433K$0.5400$0.5400
19Qwen: Qwen3.6 Plusqwen8239.61M$0.3250$1.95
20NVIDIA: Nemotron 3 Ultranvidia New8237.81M$0.5000$2.20
21Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google8239.666K$2.00$12.00
22Qwen: Qwen3 Next 80B A3B Instruct (free)qwen8220.1262KFreeFree
23Mistral: Devstral Small 1.1mistralai829.3131K$0.1000$0.3000
24Meta: Llama 4 Scoutmeta-llama821010M$0.1000$0.3000
25xAI: Grok 4.20 Multi-Agent Betax-ai8248.52M$2.00$6.00
26GPT-5.5 (Non-reasoning)OpenAI8235.4$5.00$30.00
27Google: Gemini 3 Flash Previewgoogle8237.81M$0.5000$3.00
28MiniMax: MiniMax M2minimax8228.3205K$0.2550$1.02
29Z.ai: GLM 4.5Vz-ai82766K$0.6000$1.80
30Anthropic: Claude Opus 4anthropic8231200K$15.00$75.00
31Google: Gemini 2.0 Flashgoogle8210.71M$0.1000$0.4000
32Kwaipilot: KAT-Coder-Pro V2kwaipilot8235.4256K$0.3000$1.20
33Qwen: Qwen3.6 Max Previewqwen8240262K$1.04$6.24
34Anthropic: Claude Sonnet 4.5anthropic8236.41M$3.00$15.00
35Qwen: Qwen3 30B A3B Instruct 2507qwen829.1131K$0.0482$0.1931
36Qwen: Qwen3 235B A22Bqwen8213.4131K$0.4550$1.82
37Amazon: Nova Lite 1.0amazon826.9300K$0.0600$0.2400
38MoonshotAI: Kimi K2.5moonshotai8229.4262K$0.3750$2.03
39xAI: Grok 4.1 Fastx-ai8230.62M$0.2000$0.5000
40Qwen: Qwen3 Next 80B A3B Instructqwen8213.7262K$0.0900$1.10
41xAI: Grok 4x-ai8233.3256K$3.00$15.00
42DeepSeek: DeepSeek V3 0324deepseek8215.4164K$0.2400$0.9000
43NVIDIA: Llama 3.1 Nemotron 70B Instructnvidia827.6131K$1.20$1.20
44GPT-5.5 (medium)OpenAI8250.4$5.00$30.00
45xAI: Grok 4.3x-ai8237.61M$1.25$2.50
46Xiaomi: MiMo-V2-Flashxiaomi8224.7262K$0.1000$0.3000
47Qwen: Qwen3 VL 32B Instructqwen8211.1262K$0.1040$0.4160
48OpenAI: GPT-5openai8217.2400K$1.25$10.00
49Anthropic: Claude Sonnet 4anthropic8228.91M$3.00$15.00
50OpenAI: GPT-4o (2024-05-13)openai828.6128K$5.00$15.00
#1NewTop Pick93.3
Anthropic: Claude Fable 5anthropic
AI 59.91M ctx$10.00/M in
#2NewIn-House Pick93
Anthropic: Claude Sonnet 5anthropic
AI 53.41M ctx$2.00/M in
#3Top PickIn-House Pick92.3
Anthropic: Claude Opus 4.8anthropic
AI 55.71M ctx$5.00/M in
#489.8
Google: Gemini 3.1 Pro Previewgoogle
AI 46.51M ctx$2.00/M in
#589.4
Anthropic: Claude Opus 4.7anthropic
AI 53.51M ctx$5.00/M in
#688.9
Google: Gemini 3.5 Flashgoogle
AI 50.21M ctx$1.50/M in
#7Top Pick88.3
OpenAI: GPT-5.5openai
AI 54.81.1M ctx$5.00/M in
#887
OpenAI: GPT-5.4openai
AI 51.41.1M ctx$2.50/M in
#9New86.1
Z.ai: GLM 5.2z-ai
AI 51.11M ctx$0.9300/M in
#1085.1
Anthropic: Claude Sonnet 4.6anthropic
AI 47.21M ctx$3.00/M in
#1183.5
Qwen: Qwen3.7 Maxqwen
AI 461M ctx$1.25/M in
#1283
Anthropic: Claude Opus 4.6anthropic
AI 43.71M ctx$5.00/M in
#1382.1
DeepSeek: DeepSeek V4 Prodeepseek
AI 44.31M ctx$0.4350/M in
#1482
Nex AGI: DeepSeek V3.1 Nex N1nex-agi
AI 21131K ctx$0.1350/M in
#1582
Z.ai: GLM 4.6z-ai
AI 25.1203K ctx$0.4300/M in
#1682
Qwen: Qwen3 Coder 30B A3B Instructqwen
AI 13.6160K ctx$0.0700/M in
#1782
Qwen: Qwen3 32Bqwen
AI 11.5131K ctx$0.0800/M in
#1882
Mistral: Mixtral 8x7B Instructmistralai
AI 2.433K ctx$0.5400/M in
#1982
Qwen: Qwen3.6 Plusqwen
AI 39.61M ctx$0.3250/M in
#20New82
NVIDIA: Nemotron 3 Ultranvidia
AI 37.81M ctx$0.5000/M in
#2182
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google
AI 39.666K ctx$2.00/M in
#2282
Qwen: Qwen3 Next 80B A3B Instruct (free)qwen
AI 20.1262K ctxFree/M in
#2382
Mistral: Devstral Small 1.1mistralai
AI 9.3131K ctx$0.1000/M in
#2482
Meta: Llama 4 Scoutmeta-llama
AI 1010M ctx$0.1000/M in
#2582
xAI: Grok 4.20 Multi-Agent Betax-ai
AI 48.52M ctx$2.00/M in
#2682
GPT-5.5 (Non-reasoning)OpenAI
AI 35.4$5.00/M in
#2782
Google: Gemini 3 Flash Previewgoogle
AI 37.81M ctx$0.5000/M in
#2882
MiniMax: MiniMax M2minimax
AI 28.3205K ctx$0.2550/M in
#2982
Z.ai: GLM 4.5Vz-ai
AI 766K ctx$0.6000/M in
#3082
Anthropic: Claude Opus 4anthropic
AI 31200K ctx$15.00/M in
#3182
Google: Gemini 2.0 Flashgoogle
AI 10.71M ctx$0.1000/M in
#3282
Kwaipilot: KAT-Coder-Pro V2kwaipilot
AI 35.4256K ctx$0.3000/M in
#3382
Qwen: Qwen3.6 Max Previewqwen
AI 40262K ctx$1.04/M in
#3482
Anthropic: Claude Sonnet 4.5anthropic
AI 36.41M ctx$3.00/M in
#3582
Qwen: Qwen3 30B A3B Instruct 2507qwen
AI 9.1131K ctx$0.0482/M in
#3682
Qwen: Qwen3 235B A22Bqwen
AI 13.4131K ctx$0.4550/M in
#3782
Amazon: Nova Lite 1.0amazon
AI 6.9300K ctx$0.0600/M in
#3882
MoonshotAI: Kimi K2.5moonshotai
AI 29.4262K ctx$0.3750/M in
#3982
xAI: Grok 4.1 Fastx-ai
AI 30.62M ctx$0.2000/M in
#4082
Qwen: Qwen3 Next 80B A3B Instructqwen
AI 13.7262K ctx$0.0900/M in
#4182
xAI: Grok 4x-ai
AI 33.3256K ctx$3.00/M in
#4282
DeepSeek: DeepSeek V3 0324deepseek
AI 15.4164K ctx$0.2400/M in
#4382
NVIDIA: Llama 3.1 Nemotron 70B Instructnvidia
AI 7.6131K ctx$1.20/M in
#4482
GPT-5.5 (medium)OpenAI
AI 50.4$5.00/M in
#4582
xAI: Grok 4.3x-ai
AI 37.61M ctx$1.25/M in
#4682
Xiaomi: MiMo-V2-Flashxiaomi
AI 24.7262K ctx$0.1000/M in
#4782
Qwen: Qwen3 VL 32B Instructqwen
AI 11.1262K ctx$0.1040/M in
#4882
OpenAI: GPT-5openai
AI 17.2400K ctx$1.25/M in
#4982
Anthropic: Claude Sonnet 4anthropic
AI 28.91M ctx$3.00/M in
#5082
OpenAI: GPT-4o (2024-05-13)openai
AI 8.6128K ctx$5.00/M in

How we rank AI models

The Design for Online AI Model Leaderboard scores 592 models on a single 0–100 scale built from four weighted dimensions: intelligence (reasoning and knowledge benchmarks), technical capability (coding and tool use), content quality (writing and instruction-following) and value (capability per dollar).

Underlying data is aggregated from the OpenRouter API for pricing and availability, Artificial Analysis for intelligence, coding and agentic indices, and the Hugging Face Open LLM Leaderboard for open-model benchmarks. We refresh these sources daily and layer our own editorial review on top, so a model that benchmarks well but is impractical to deploy will not automatically top the table.

Models are grouped into tiers (Frontier, Professional, Specialist, Efficient, Emerging and Legacy) to make like-for-like comparison easier, and newly released models are flagged so you can see what has just landed.

Leaderboard FAQ

How often is the leaderboard updated?

Pricing, availability and benchmark data are synced daily from our sources, and editorial scores are reviewed whenever a significant new model is released.

How is the overall score calculated?

Each model is graded 0–10 on intelligence, technical capability, content quality and value; those dimensions are weighted and combined into the 0–100 overall score used to rank the table.

Where does the data come from?

From the OpenRouter API, Artificial Analysis and the Hugging Face Open LLM Leaderboard, combined with hands-on editorial testing by the Design for Online team.