Tool Use

Models with strong tool-use and function-calling support.

Updated July 4, 2026

302 Models tracked

46 Providers

Daily Data refresh

Leading right now Anthropic: Claude Fable 5 1M context · $10.00/1M in 93.3 Score

Models with strong tool-use and function-calling support.

#	Model	Score	AI Index	Context	Input / 1M	Output / 1M
1	Anthropic: Claude Fable 5anthropic New Top Pick	93.3	59.9	1M	$10.00	$50.00
2	Anthropic: Claude Sonnet 5anthropic New In-House Pick	93	53.4	1M	$2.00	$10.00
3	Anthropic: Claude Opus 4.8anthropic Top Pick In-House Pick	92.3	55.7	1M	$5.00	$25.00
4	Google: Gemini 3.1 Pro Previewgoogle	89.8	46.5	1M	$2.00	$12.00
5	Anthropic: Claude Opus 4.7anthropic	89.4	53.5	1M	$5.00	$25.00
6	Google: Gemini 3.5 Flashgoogle	88.9	50.2	1M	$1.50	$9.00
7	OpenAI: GPT-5.5openai Top Pick	88.3	54.8	1.1M	$5.00	$30.00
8	OpenAI: GPT-5.4openai	87	51.4	1.1M	$2.50	$15.00
9	Z.ai: GLM 5.2z-ai New	86.1	51.1	1M	$0.7700	$2.42
10	Anthropic: Claude Sonnet 4.6anthropic	85.1	47.2	1M	$3.00	$15.00
11	Qwen: Qwen3.7 Maxqwen	83.5	46	1M	$1.25	$3.75
12	Anthropic: Claude Opus 4.6anthropic	83	37.8	1M	$5.00	$25.00
13	DeepSeek: DeepSeek V4 Prodeepseek	82.1	44.3	1M	$0.4350	$0.8700
14	OpenAI: GPT-5.3-Codexopenai	82	44.3	400K	$1.75	$14.00
15	Qwen: Qwen3 VL 30B A3B Thinkingqwen	82	19.7	131K	$0.1300	$1.56
16	OpenAI: gpt-oss-20bopenai	82	14.9	131K	$0.0290	$0.1400
17	Qwen: Qwen3 8Bqwen	82	8.3	131K	$0.1170	$0.4550
18	OpenAI: o1openai	82	23.4	200K	$15.00	$60.00
19	Z.ai: GLM 5.1z-ai	82	40.2	203K	$0.9660	$3.04
20	Qwen3.6 27B (Non-reasoning)Alibaba	82	29.3		$0.6000	$3.60
21	Qwen: Qwen3.7 Plusqwen	82	39	1M	$0.3200	$1.28
22	Prime Intellect: INTELLECT-3prime-intellect	82	15.6	131K	$0.2000	$1.10
23	xAI: Grok 4 Fastx-ai	82	16.5	2M	$0.2000	$0.5000
24	MoonshotAI: Kimi K2 0711moonshotai	82	19.4	131K	$0.5700	$2.30
25	Anthropic: Claude 3.5 Haikuanthropic	82	12.3	200K	$0.8000	$4.00
26	Qwen: Qwen3.5-9Bqwen	82	21.4	262K	$0.1000	$0.1500
27	inclusionAI: Ling-2.6-flashinclusionai	82	19.3	262K	$0.0100	$0.0300
28	Grok Build 0.1 0616xAI New	82	39.8		$1.00	$2.00
29	MiniMax: MiniMax M2.1minimax	82	31.4	205K	$0.3000	$1.20
30	Amazon: Nova Premier 1.0amazon	82	12.7	1M	$2.50	$12.50
31	OpenAI: GPT-4o Audioopenai	82	12.8	128K	$2.50	$10.00
32	xAI: Grok 3x-ai	82	18.4	131K	$3.00	$15.00
33	Mistral: Sabamistralai	82	6.4	33K	$0.2000	$0.6000
34	Xiaomi: MiMo-V2-Proxiaomi	82	40.3	1M	$1.00	$3.00
35	Qwen: Qwen3.6 35B A3Bqwen	82	31.6	262K	$0.1400	$1.00
36	Z.ai: GLM 4.6Vz-ai	82	16.8	131K	$0.3000	$0.9000
37	Qwen: Qwen3 VL 30B A3B Instructqwen	82	10	262K	$0.1300	$0.5200
38	Anthropic: Claude Opus 4.1anthropic	82	33.7	200K	$15.00	$75.00
39	Qwen: Qwen3 14Bqwen	82	10.4	132K	$0.1000	$0.2400
40	Meta: Llama 3.3 70B Instruct (free)meta-llama	82	9.4	131K	Free	Free
41	StepFun: Step 3.5 Flashstepfun	82	26	262K	$0.1000	$0.3000
42	Anthropic: Claude Opus 4.5anthropic	82	34.7	200K	$5.00	$25.00
43	Qwen: Qwen3 Next 80B A3B Thinkingqwen	82	16.7	262K	$0.0975	$0.7800
44	Mistral: Devstral Mediummistralai	82	12.4	131K	$0.4000	$2.00
45	Meta: Llama 4 Maverickmeta-llama	82	14.3	1M	$0.1500	$0.6000
46	Anthropic: Claude 3.5 Sonnetanthropic	82	9.9	200K	$6.00	$30.00
47	DeepSeek: DeepSeek V4 Flashdeepseek Best Value	82	40.3	1M	$0.0900	$0.1800
48	IBM: Granite 4.1 8Bibm-granite	82	6.7	131K	$0.0500	$0.1000
49	Z.ai: GLM 4.7z-ai	82	33.7	203K	$0.4000	$1.75
50	NVIDIA: Nemotron Nano 12B 2 VLnvidia	82	4.6	131K	$0.2000	$0.6000

Anthropic: Claude Fable 5anthropic

AI 59.91M ctx$10.00/M in

Anthropic: Claude Sonnet 5anthropic

AI 53.41M ctx$2.00/M in

Anthropic: Claude Opus 4.8anthropic

AI 55.71M ctx$5.00/M in

Google: Gemini 3.1 Pro Previewgoogle

AI 46.51M ctx$2.00/M in

Anthropic: Claude Opus 4.7anthropic

AI 53.51M ctx$5.00/M in

Google: Gemini 3.5 Flashgoogle

AI 50.21M ctx$1.50/M in

OpenAI: GPT-5.5openai

AI 54.81.1M ctx$5.00/M in

OpenAI: GPT-5.4openai

AI 51.41.1M ctx$2.50/M in

Z.ai: GLM 5.2z-ai

AI 51.11M ctx$0.7700/M in

Anthropic: Claude Sonnet 4.6anthropic

AI 47.21M ctx$3.00/M in

Qwen: Qwen3.7 Maxqwen

AI 461M ctx$1.25/M in

Anthropic: Claude Opus 4.6anthropic

AI 37.81M ctx$5.00/M in

DeepSeek: DeepSeek V4 Prodeepseek

AI 44.31M ctx$0.4350/M in

OpenAI: GPT-5.3-Codexopenai

AI 44.3400K ctx$1.75/M in

Qwen: Qwen3 VL 30B A3B Thinkingqwen

AI 19.7131K ctx$0.1300/M in

OpenAI: gpt-oss-20bopenai

AI 14.9131K ctx$0.0290/M in

Qwen: Qwen3 8Bqwen

AI 8.3131K ctx$0.1170/M in

OpenAI: o1openai

AI 23.4200K ctx$15.00/M in

Z.ai: GLM 5.1z-ai

AI 40.2203K ctx$0.9660/M in

Qwen3.6 27B (Non-reasoning)Alibaba

AI 29.3$0.6000/M in

Qwen: Qwen3.7 Plusqwen

AI 391M ctx$0.3200/M in

Prime Intellect: INTELLECT-3prime-intellect

AI 15.6131K ctx$0.2000/M in

xAI: Grok 4 Fastx-ai

AI 16.52M ctx$0.2000/M in

MoonshotAI: Kimi K2 0711moonshotai

AI 19.4131K ctx$0.5700/M in

Anthropic: Claude 3.5 Haikuanthropic

AI 12.3200K ctx$0.8000/M in

Qwen: Qwen3.5-9Bqwen

AI 21.4262K ctx$0.1000/M in

inclusionAI: Ling-2.6-flashinclusionai

AI 19.3262K ctx$0.0100/M in

Grok Build 0.1 0616xAI

AI 39.8$1.00/M in

MiniMax: MiniMax M2.1minimax

AI 31.4205K ctx$0.3000/M in

Amazon: Nova Premier 1.0amazon

AI 12.71M ctx$2.50/M in

OpenAI: GPT-4o Audioopenai

AI 12.8128K ctx$2.50/M in

xAI: Grok 3x-ai

AI 18.4131K ctx$3.00/M in

Mistral: Sabamistralai

AI 6.433K ctx$0.2000/M in

Xiaomi: MiMo-V2-Proxiaomi

AI 40.31M ctx$1.00/M in

Qwen: Qwen3.6 35B A3Bqwen

AI 31.6262K ctx$0.1400/M in

Z.ai: GLM 4.6Vz-ai

AI 16.8131K ctx$0.3000/M in

Qwen: Qwen3 VL 30B A3B Instructqwen

AI 10262K ctx$0.1300/M in

Anthropic: Claude Opus 4.1anthropic

AI 33.7200K ctx$15.00/M in

Qwen: Qwen3 14Bqwen

AI 10.4132K ctx$0.1000/M in

Meta: Llama 3.3 70B Instruct (free)meta-llama

AI 9.4131K ctxFree/M in

StepFun: Step 3.5 Flashstepfun

AI 26262K ctx$0.1000/M in

Anthropic: Claude Opus 4.5anthropic

AI 34.7200K ctx$5.00/M in

Qwen: Qwen3 Next 80B A3B Thinkingqwen

AI 16.7262K ctx$0.0975/M in

Mistral: Devstral Mediummistralai

AI 12.4131K ctx$0.4000/M in

Meta: Llama 4 Maverickmeta-llama

AI 14.31M ctx$0.1500/M in

Anthropic: Claude 3.5 Sonnetanthropic

AI 9.9200K ctx$6.00/M in

DeepSeek: DeepSeek V4 Flashdeepseek

AI 40.31M ctx$0.0900/M in

IBM: Granite 4.1 8Bibm-granite

AI 6.7131K ctx$0.0500/M in

Z.ai: GLM 4.7z-ai

AI 33.7203K ctx$0.4000/M in

NVIDIA: Nemotron Nano 12B 2 VLnvidia

AI 4.6131K ctx$0.2000/M in

How we rank AI models

The Design for Online AI Model Leaderboard scores 592 models on a single 0–100 scale built from four weighted dimensions: intelligence (reasoning and knowledge benchmarks), technical capability (coding and tool use), content quality (writing and instruction-following) and value (capability per dollar).

Underlying data is aggregated from the OpenRouter API for pricing and availability, Artificial Analysis for intelligence, coding and agentic indices, and the Hugging Face Open LLM Leaderboard for open-model benchmarks. We refresh these sources daily and layer our own editorial review on top, so a model that benchmarks well but is impractical to deploy will not automatically top the table.

Models are grouped into tiers (Frontier, Professional, Specialist, Efficient, Emerging and Legacy) to make like-for-like comparison easier, and newly released models are flagged so you can see what has just landed.

Leaderboard FAQ

How often is the leaderboard updated?

Pricing, availability and benchmark data are synced daily from our sources, and editorial scores are reviewed whenever a significant new model is released.

How is the overall score calculated?

Each model is graded 0–10 on intelligence, technical capability, content quality and value; those dimensions are weighted and combined into the 0–100 overall score used to rank the table.

Where does the data come from?

From the OpenRouter API, Artificial Analysis and the Hugging Face Open LLM Leaderboard, combined with hands-on editorial testing by the Design for Online team.