Tool Use

Models with strong tool-use and function-calling support.

Updated July 4, 2026

302 Models tracked

46 Providers

Daily Data refresh

Leading right now Anthropic: Claude Fable 5 1M context · $10.00/1M in 93.3 Score

Models with strong tool-use and function-calling support.

#	Model	Score	AI Index	Context	Input / 1M	Output / 1M
1	Anthropic: Claude Fable 5anthropic New Top Pick	93.3	59.9	1M	$10.00	$50.00
2	Anthropic: Claude Sonnet 5anthropic New In-House Pick	93	53.4	1M	$2.00	$10.00
3	Anthropic: Claude Opus 4.8anthropic Top Pick In-House Pick	92.3	55.7	1M	$5.00	$25.00
4	Google: Gemini 3.1 Pro Previewgoogle	89.8	46.5	1M	$2.00	$12.00
5	Anthropic: Claude Opus 4.7anthropic	89.4	53.5	1M	$5.00	$25.00
6	Google: Gemini 3.5 Flashgoogle	88.9	50.2	1M	$1.50	$9.00
7	OpenAI: GPT-5.5openai Top Pick	88.3	54.8	1.1M	$5.00	$30.00
8	OpenAI: GPT-5.4openai	87	51.4	1.1M	$2.50	$15.00
9	Z.ai: GLM 5.2z-ai New	86.1	51.1	1M	$0.7700	$2.42
10	Anthropic: Claude Sonnet 4.6anthropic	85.1	47.2	1M	$3.00	$15.00
11	Qwen: Qwen3.7 Maxqwen	83.5	46	1M	$1.25	$3.75
12	Anthropic: Claude Opus 4.6anthropic	83	37.8	1M	$5.00	$25.00
13	DeepSeek: DeepSeek V4 Prodeepseek	82.1	44.3	1M	$0.4350	$0.8700
14	DeepSeek: DeepSeek V3.2deepseek	82	24.7	131K	$0.2288	$0.3432
15	DeepSeek: DeepSeek V3.1 Terminusdeepseek	82	21.4	164K	$0.2700	$0.9500
16	Qwen: Qwen3 235B A22B Instruct 2507qwen	82	19.6	262K	$0.0900	$0.1000
17	Inception: Mercury 2inception	82	25.3	128K	$0.2500	$0.7500
18	inclusionAI: Ling-2.6-1T (free)inclusionai	82	33.6	262K	Free	Free
19	Nex AGI: Nex-N2-Pronex-agi New Best for Agents	82	41	262K	$0.2500	$1.00
20	MoonshotAI: Kimi K2 Thinkingmoonshotai	82	32.7	262K	$0.6000	$2.50
21	DeepSeek: DeepSeek V3.1deepseek	82	20.7	164K	$0.2100	$0.7900
22	xAI: Grok 3 Minix-ai	82	22.5	131K	$0.3000	$0.5000
23	Anthropic: Claude 3.7 Sonnet (thinking)anthropic	82	27.1	200K	$3.00	$15.00
24	Xiaomi: MiMo-V2-Omnixiaomi	82	35	262K	$0.4000	$2.00
25	Nova 2.0 Lite (medium)Amazon	82	19		$0.3000	$2.50
26	OpenAI: GPT-5.3-Codexopenai	82	44.3	400K	$1.75	$14.00
27	Qwen: Qwen3 VL 30B A3B Thinkingqwen	82	19.7	131K	$0.1300	$1.56
28	OpenAI: gpt-oss-20bopenai	82	14.9	131K	$0.0290	$0.1400
29	Qwen: Qwen3 8Bqwen	82	8.3	131K	$0.1170	$0.4550
30	OpenAI: o1openai	82	23.4	200K	$15.00	$60.00
31	Z.ai: GLM 5.1z-ai	82	40.2	203K	$0.9660	$3.04
32	Qwen3.6 27B (Non-reasoning)Alibaba	82	29.3		$0.6000	$3.60
33	Qwen: Qwen3.7 Plusqwen	82	39	1M	$0.3200	$1.28
34	Prime Intellect: INTELLECT-3prime-intellect	82	15.6	131K	$0.2000	$1.10
35	xAI: Grok 4 Fastx-ai	82	16.5	2M	$0.2000	$0.5000
36	MoonshotAI: Kimi K2 0711moonshotai	82	19.4	131K	$0.5700	$2.30
37	Anthropic: Claude 3.5 Haikuanthropic	82	12.3	200K	$0.8000	$4.00
38	Qwen: Qwen3.5-9Bqwen	82	21.4	262K	$0.1000	$0.1500
39	inclusionAI: Ling-2.6-flashinclusionai	82	19.3	262K	$0.0100	$0.0300
40	Grok Build 0.1 0616xAI New	82	39.8		$1.00	$2.00
41	MiniMax: MiniMax M2.1minimax	82	31.4	205K	$0.3000	$1.20
42	Amazon: Nova Premier 1.0amazon	82	12.7	1M	$2.50	$12.50
43	OpenAI: GPT-4o Audioopenai	82	12.8	128K	$2.50	$10.00
44	xAI: Grok 3x-ai	82	18.4	131K	$3.00	$15.00
45	Mistral: Sabamistralai	82	6.4	33K	$0.2000	$0.6000
46	Xiaomi: MiMo-V2-Proxiaomi	82	40.3	1M	$1.00	$3.00
47	Qwen: Qwen3.6 35B A3Bqwen	82	31.6	262K	$0.1400	$1.00
48	Z.ai: GLM 4.6Vz-ai	82	16.8	131K	$0.3000	$0.9000
49	Qwen: Qwen3 VL 30B A3B Instructqwen	82	10	262K	$0.1300	$0.5200
50	Anthropic: Claude Opus 4.1anthropic	82	33.7	200K	$15.00	$75.00

Anthropic: Claude Fable 5anthropic

AI 59.91M ctx$10.00/M in

Anthropic: Claude Sonnet 5anthropic

AI 53.41M ctx$2.00/M in

Anthropic: Claude Opus 4.8anthropic

AI 55.71M ctx$5.00/M in

Google: Gemini 3.1 Pro Previewgoogle

AI 46.51M ctx$2.00/M in

Anthropic: Claude Opus 4.7anthropic

AI 53.51M ctx$5.00/M in

Google: Gemini 3.5 Flashgoogle

AI 50.21M ctx$1.50/M in

OpenAI: GPT-5.5openai

AI 54.81.1M ctx$5.00/M in

OpenAI: GPT-5.4openai

AI 51.41.1M ctx$2.50/M in

Z.ai: GLM 5.2z-ai

AI 51.11M ctx$0.7700/M in

Anthropic: Claude Sonnet 4.6anthropic

AI 47.21M ctx$3.00/M in

Qwen: Qwen3.7 Maxqwen

AI 461M ctx$1.25/M in

Anthropic: Claude Opus 4.6anthropic

AI 37.81M ctx$5.00/M in

DeepSeek: DeepSeek V4 Prodeepseek

AI 44.31M ctx$0.4350/M in

DeepSeek: DeepSeek V3.2deepseek

AI 24.7131K ctx$0.2288/M in

DeepSeek: DeepSeek V3.1 Terminusdeepseek

AI 21.4164K ctx$0.2700/M in

Qwen: Qwen3 235B A22B Instruct 2507qwen

AI 19.6262K ctx$0.0900/M in

Inception: Mercury 2inception

AI 25.3128K ctx$0.2500/M in

inclusionAI: Ling-2.6-1T (free)inclusionai

AI 33.6262K ctxFree/M in

Nex AGI: Nex-N2-Pronex-agi

AI 41262K ctx$0.2500/M in

MoonshotAI: Kimi K2 Thinkingmoonshotai

AI 32.7262K ctx$0.6000/M in

DeepSeek: DeepSeek V3.1deepseek

AI 20.7164K ctx$0.2100/M in

xAI: Grok 3 Minix-ai

AI 22.5131K ctx$0.3000/M in

Anthropic: Claude 3.7 Sonnet (thinking)anthropic

AI 27.1200K ctx$3.00/M in

Xiaomi: MiMo-V2-Omnixiaomi

AI 35262K ctx$0.4000/M in

Nova 2.0 Lite (medium)Amazon

AI 19$0.3000/M in

OpenAI: GPT-5.3-Codexopenai

AI 44.3400K ctx$1.75/M in

Qwen: Qwen3 VL 30B A3B Thinkingqwen

AI 19.7131K ctx$0.1300/M in

OpenAI: gpt-oss-20bopenai

AI 14.9131K ctx$0.0290/M in

Qwen: Qwen3 8Bqwen

AI 8.3131K ctx$0.1170/M in

OpenAI: o1openai

AI 23.4200K ctx$15.00/M in

Z.ai: GLM 5.1z-ai

AI 40.2203K ctx$0.9660/M in

Qwen3.6 27B (Non-reasoning)Alibaba

AI 29.3$0.6000/M in

Qwen: Qwen3.7 Plusqwen

AI 391M ctx$0.3200/M in

Prime Intellect: INTELLECT-3prime-intellect

AI 15.6131K ctx$0.2000/M in

xAI: Grok 4 Fastx-ai

AI 16.52M ctx$0.2000/M in

MoonshotAI: Kimi K2 0711moonshotai

AI 19.4131K ctx$0.5700/M in

Anthropic: Claude 3.5 Haikuanthropic

AI 12.3200K ctx$0.8000/M in

Qwen: Qwen3.5-9Bqwen

AI 21.4262K ctx$0.1000/M in

inclusionAI: Ling-2.6-flashinclusionai

AI 19.3262K ctx$0.0100/M in

Grok Build 0.1 0616xAI

AI 39.8$1.00/M in

MiniMax: MiniMax M2.1minimax

AI 31.4205K ctx$0.3000/M in

Amazon: Nova Premier 1.0amazon

AI 12.71M ctx$2.50/M in

OpenAI: GPT-4o Audioopenai

AI 12.8128K ctx$2.50/M in

xAI: Grok 3x-ai

AI 18.4131K ctx$3.00/M in

Mistral: Sabamistralai

AI 6.433K ctx$0.2000/M in

Xiaomi: MiMo-V2-Proxiaomi

AI 40.31M ctx$1.00/M in

Qwen: Qwen3.6 35B A3Bqwen

AI 31.6262K ctx$0.1400/M in

Z.ai: GLM 4.6Vz-ai

AI 16.8131K ctx$0.3000/M in

Qwen: Qwen3 VL 30B A3B Instructqwen

AI 10262K ctx$0.1300/M in

Anthropic: Claude Opus 4.1anthropic

AI 33.7200K ctx$15.00/M in

How we rank AI models

The Design for Online AI Model Leaderboard scores 592 models on a single 0–100 scale built from four weighted dimensions: intelligence (reasoning and knowledge benchmarks), technical capability (coding and tool use), content quality (writing and instruction-following) and value (capability per dollar).

Underlying data is aggregated from the OpenRouter API for pricing and availability, Artificial Analysis for intelligence, coding and agentic indices, and the Hugging Face Open LLM Leaderboard for open-model benchmarks. We refresh these sources daily and layer our own editorial review on top, so a model that benchmarks well but is impractical to deploy will not automatically top the table.

Models are grouped into tiers (Frontier, Professional, Specialist, Efficient, Emerging and Legacy) to make like-for-like comparison easier, and newly released models are flagged so you can see what has just landed.

Leaderboard FAQ

How often is the leaderboard updated?

Pricing, availability and benchmark data are synced daily from our sources, and editorial scores are reviewed whenever a significant new model is released.

How is the overall score calculated?

Each model is graded 0–10 on intelligence, technical capability, content quality and value; those dimensions are weighted and combined into the 0–100 overall score used to rank the table.

Where does the data come from?

From the OpenRouter API, Artificial Analysis and the Hugging Face Open LLM Leaderboard, combined with hands-on editorial testing by the Design for Online team.