Tool Use

Models with strong tool-use and function-calling support.

Updated June 11, 2026

Models with strong tool-use and function-calling support.

# Model Score AI Index Context Input / 1M Output / 1M Caps
1Anthropic: Claude Fable 5anthropic New Top Pick94.764.91M$10.00$50.00
2Anthropic: Claude Opus 4.8anthropic New Top Pick In-House Pick92.461.41M$5.00$25.00
3Google: Gemini 3.1 Pro Previewgoogle Best for Agents91.757.21M$2.00$12.00
4OpenAI: GPT-5.5openai Top Pick88.860.21.1M$5.00$30.00
5Anthropic: Claude Opus 4.7anthropic88.357.31M$5.00$25.00
6Anthropic: Claude Sonnet 4.6anthropic In-House Pick84.451.71M$3.00$15.00
7Qwen: Qwen3.7 Maxqwen New83.956.61M$1.25$3.75
8OpenAI: GPT-5.3-Codexopenai83.553.6400K$1.75$14.00
9Google: Gemini 3 Flash Previewgoogle82.5351M$0.5000$3.00
10Anthropic: Claude Haiku 4.5anthropic8231200K$1.00$5.00
11Anthropic: Claude Opus 4.1anthropic8242200K$15.00$75.00
12Qwen: Qwen3 30B A3Bqwen8215.3131K$0.1200$0.5000
13OpenAI: o1openai8230.7200K$15.00$60.00
14Google: Gemini 2.5 Flash Lite Preview 09-2025google8219.41M$0.1000$0.4000
15Qwen: Qwen3 235B A22B Instruct 2507qwen8225262K$0.0900$0.1000
16Anthropic: Claude 3.5 Sonnetanthropic8215.9200K$6.00$30.00
17DeepSeek: DeepSeek V4 Flashdeepseek Best Value82461M$0.0983$0.1966
18Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google8248.466K$2.00$12.00
19xAI: Grok Code Fast 1x-ai8228.7256K$0.2000$1.50
20OpenAI: o3 Proopenai8240.7200K$20.00$80.00
21Anthropic: Claude 3.7 Sonnet (thinking)anthropic8234.7200K$3.00$15.00
22OpenAI: GPT-4o-miniopenai8212.6128K$0.1500$0.6000
23Qwen3 Coder 480B A35B InstructAlibaba8224.8$1.50$7.50
24MiniMax: MiniMax M3minimax New8254.71M$0.3000$1.20
25Xiaomi: MiMo-V2-Flashxiaomi8230.3262K$0.1000$0.3000
26Qwen: Qwen3 VL 8B Thinkingqwen8216.7256K$0.1170$1.37
27Qwen: Qwen3 30B A3B Instruct 2507qwen8215131K$0.0482$0.1931
28Qwen: Qwen3 8Bqwen8210.6131K$0.0500$0.4000
29Meta: Llama 3.3 70B Instruct (free)meta-llama8214.5131KFreeFree
30Mistral: Mixtral 8x7B Instructmistralai827.733K$0.5400$0.5400
31Qwen: Qwen3.6 Plusqwen82501M$0.3250$1.95
32inclusionAI: Ling-2.6-flashinclusionai8226.2262K$0.0100$0.0300
33Qwen: Qwen3 VL 235B A22B Thinkingqwen8227.6131K$0.2600$2.60
34MoonshotAI: Kimi K2 0711moonshotai8226.3131K$0.5700$2.30
35xAI: Grok 4.20 Multi-Agent Betax-ai8248.52M$2.00$6.00
36GPT-5.5 (Non-reasoning)OpenAI8240.9$5.00$30.00
37MoonshotAI: Kimi K2.5moonshotai8246.8262K$0.4000$1.90
38xAI: Grok 4.1 Fastx-ai8223.62M$0.2000$0.5000
39OpenAI: GPT-4o Audioopenai8212.8128K$2.50$10.00
40xAI: Grok 3 Minix-ai8232.1131K$0.3000$0.5000
41OpenAI: o3 Mini Highopenai8225.2200K$1.10$4.40
42Kwaipilot: KAT-Coder-Pro V2kwaipilot8243.8256K$0.3000$1.20
43Qwen: Qwen3.7 Plusqwen New8253.31M$0.3200$1.28
44Qwen: Qwen3 VL 8B Instructqwen8214.3256K$0.0800$0.5000
45Z.ai: GLM 4.5z-ai8226.4131K$0.6000$2.20
46Qwen: Qwen3 14Bqwen8216.2132K$0.1000$0.2400
47IBM: Granite 4.1 8Bibm-granite8212.4131K$0.0500$0.1000
48Qwen: Qwen3 VL 235B A22B Instructqwen8220.8262K$0.2000$0.8800
49Mistral: Devstral Mediummistralai8218.7131K$0.4000$2.00
50Meta: Llama 4 Maverickmeta-llama8218.41M$0.1500$0.6000
#1NewTop Pick94.7
Anthropic: Claude Fable 5anthropic
AI 64.91M ctx$10.00/M in
#2NewTop PickIn-House Pick92.4
Anthropic: Claude Opus 4.8anthropic
AI 61.41M ctx$5.00/M in
#3Best for Agents91.7
Google: Gemini 3.1 Pro Previewgoogle
AI 57.21M ctx$2.00/M in
#4Top Pick88.8
OpenAI: GPT-5.5openai
AI 60.21.1M ctx$5.00/M in
#588.3
Anthropic: Claude Opus 4.7anthropic
AI 57.31M ctx$5.00/M in
#6In-House Pick84.4
Anthropic: Claude Sonnet 4.6anthropic
AI 51.71M ctx$3.00/M in
#7New83.9
Qwen: Qwen3.7 Maxqwen
AI 56.61M ctx$1.25/M in
#883.5
OpenAI: GPT-5.3-Codexopenai
AI 53.6400K ctx$1.75/M in
#982.5
Google: Gemini 3 Flash Previewgoogle
AI 351M ctx$0.5000/M in
#1082
Anthropic: Claude Haiku 4.5anthropic
AI 31200K ctx$1.00/M in
#1182
Anthropic: Claude Opus 4.1anthropic
AI 42200K ctx$15.00/M in
#1282
Qwen: Qwen3 30B A3Bqwen
AI 15.3131K ctx$0.1200/M in
#1382
OpenAI: o1openai
AI 30.7200K ctx$15.00/M in
#1482
Google: Gemini 2.5 Flash Lite Preview 09-2025google
AI 19.41M ctx$0.1000/M in
#1582
Qwen: Qwen3 235B A22B Instruct 2507qwen
AI 25262K ctx$0.0900/M in
#1682
Anthropic: Claude 3.5 Sonnetanthropic
AI 15.9200K ctx$6.00/M in
#17Best Value82
DeepSeek: DeepSeek V4 Flashdeepseek
AI 461M ctx$0.0983/M in
#1882
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)google
AI 48.466K ctx$2.00/M in
#1982
xAI: Grok Code Fast 1x-ai
AI 28.7256K ctx$0.2000/M in
#2082
OpenAI: o3 Proopenai
AI 40.7200K ctx$20.00/M in
#2182
Anthropic: Claude 3.7 Sonnet (thinking)anthropic
AI 34.7200K ctx$3.00/M in
#2282
OpenAI: GPT-4o-miniopenai
AI 12.6128K ctx$0.1500/M in
#2382
Qwen3 Coder 480B A35B InstructAlibaba
AI 24.8$1.50/M in
#24New82
MiniMax: MiniMax M3minimax
AI 54.71M ctx$0.3000/M in
#2582
Xiaomi: MiMo-V2-Flashxiaomi
AI 30.3262K ctx$0.1000/M in
#2682
Qwen: Qwen3 VL 8B Thinkingqwen
AI 16.7256K ctx$0.1170/M in
#2782
Qwen: Qwen3 30B A3B Instruct 2507qwen
AI 15131K ctx$0.0482/M in
#2882
Qwen: Qwen3 8Bqwen
AI 10.6131K ctx$0.0500/M in
#2982
Meta: Llama 3.3 70B Instruct (free)meta-llama
AI 14.5131K ctxFree/M in
#3082
Mistral: Mixtral 8x7B Instructmistralai
AI 7.733K ctx$0.5400/M in
#3182
Qwen: Qwen3.6 Plusqwen
AI 501M ctx$0.3250/M in
#3282
inclusionAI: Ling-2.6-flashinclusionai
AI 26.2262K ctx$0.0100/M in
#3382
Qwen: Qwen3 VL 235B A22B Thinkingqwen
AI 27.6131K ctx$0.2600/M in
#3482
MoonshotAI: Kimi K2 0711moonshotai
AI 26.3131K ctx$0.5700/M in
#3582
xAI: Grok 4.20 Multi-Agent Betax-ai
AI 48.52M ctx$2.00/M in
#3682
GPT-5.5 (Non-reasoning)OpenAI
AI 40.9$5.00/M in
#3782
MoonshotAI: Kimi K2.5moonshotai
AI 46.8262K ctx$0.4000/M in
#3882
xAI: Grok 4.1 Fastx-ai
AI 23.62M ctx$0.2000/M in
#3982
OpenAI: GPT-4o Audioopenai
AI 12.8128K ctx$2.50/M in
#4082
xAI: Grok 3 Minix-ai
AI 32.1131K ctx$0.3000/M in
#4182
OpenAI: o3 Mini Highopenai
AI 25.2200K ctx$1.10/M in
#4282
Kwaipilot: KAT-Coder-Pro V2kwaipilot
AI 43.8256K ctx$0.3000/M in
#43New82
Qwen: Qwen3.7 Plusqwen
AI 53.31M ctx$0.3200/M in
#4482
Qwen: Qwen3 VL 8B Instructqwen
AI 14.3256K ctx$0.0800/M in
#4582
Z.ai: GLM 4.5z-ai
AI 26.4131K ctx$0.6000/M in
#4682
Qwen: Qwen3 14Bqwen
AI 16.2132K ctx$0.1000/M in
#4782
IBM: Granite 4.1 8Bibm-granite
AI 12.4131K ctx$0.0500/M in
#4882
Qwen: Qwen3 VL 235B A22B Instructqwen
AI 20.8262K ctx$0.2000/M in
#4982
Mistral: Devstral Mediummistralai
AI 18.7131K ctx$0.4000/M in
#5082
Meta: Llama 4 Maverickmeta-llama
AI 18.41M ctx$0.1500/M in

How we rank AI models

The Design for Online AI Model Leaderboard scores 577 models on a single 0–100 scale built from four weighted dimensions: intelligence (reasoning and knowledge benchmarks), technical capability (coding and tool use), content quality (writing and instruction-following) and value (capability per dollar).

Underlying data is aggregated from the OpenRouter API for pricing and availability, Artificial Analysis for intelligence, coding and agentic indices, and the Hugging Face Open LLM Leaderboard for open-model benchmarks. We refresh these sources daily and layer our own editorial review on top, so a model that benchmarks well but is impractical to deploy will not automatically top the table.

Models are grouped into tiers (Frontier, Professional, Specialist, Efficient, Emerging and Legacy) to make like-for-like comparison easier, and newly released models are flagged so you can see what has just landed.

Leaderboard FAQ

How often is the leaderboard updated?

Pricing, availability and benchmark data are synced daily from our sources, and editorial scores are reviewed whenever a significant new model is released.

How is the overall score calculated?

Each model is graded 0–10 on intelligence, technical capability, content quality and value; those dimensions are weighted and combined into the 0–100 overall score used to rank the table.

Where does the data come from?

From the OpenRouter API, Artificial Analysis and the Hugging Face Open LLM Leaderboard, combined with hands-on editorial testing by the Design for Online team.