Best AI Models for SEO.

Best models for search engine optimisation tasks.

36 Models tracked

14 Providers

Daily Data refresh

1 Editorial picks

Overall AI Agents Coding Content Writing General SEO Tool Use

#	Model		DFO Score ▾	Tok/s	In $/1M	Out $/1M	Ctx
1	Gemma 4 31BGoogle		82.0	35.9	$0.1200	$0.3700	262K
2	Qwen3.5 4B (Non-reasoning)Alibaba		82.0	17.1	$0.0300	$0.1500	–
3	GPT-5 NanoOpenAI		82.0	168	$0.0500	$0.4000	400K
4	Gemma 4 26B A4BGoogle		82.0	65.3	$0.0700	$0.3400	262K
5	Gemini 1.5 Flash (Sep ’24)Google		82.0	–	Free	Free	–
6	Mistral Small 3.2 24BMistral		82.0	145	$0.1000	$0.3000	131K
7	DeepSeek V4 FlashDeepSeek	BEST VALUE	82.0	101	$0.0980	$0.1960	1M
8	Gemini 1.5 Flash-8BGoogle		82.0	–	Free	Free	–
9	GPT-4o-mini Search PreviewOpenAI		82.0	53.0	$0.1500	$0.6000	128K
10	Mistral Large 3Mistral		82.0	51.3	$0.5000	$1.50	–
11	SabaMistral		82.0	–	$0.2000	$0.6000	33K
12	Magistral Small 1.2Mistral		82.0	83.5	$0.5000	$1.50	–
13	Step 3.5 FlashStepFun		82.0	295	$0.1000	$0.3000	262K
14	Magistral Medium 1.2Mistral		82.0	42.8	$2.00	$5.00	–
15	Mistral Small CreativeMistral		82.0	136	$0.1000	$0.3000	33K
16	Ling-2.6-flashinclusionAI		82.0	169	$0.0100	$0.0300	262K
17	GPT-4o-miniOpenAI		82.0	75.3	$0.1500	$0.6000	128K
18	Nova 2.0 Omni (medium)Amazon		82.0	–	$0.3000	$2.50	–
19	Ministral 3 14B 2512Mistral		82.0	132	$0.2000	$0.2000	262K
20	Gemini 3.1 Flash Lite PreviewGoogle		82.0	296	$0.2500	$1.50	1M

Showing 1–20 of 36 · Data from OpenRouter, Artificial Analysis, Hugging Face & our own testing. Scores editorially curated.

We deploy these models for businesses every week. Get a recommendation for your workload.

Get Started

Best models for search engine optimisation tasks.

Leaderboards by use case

The overall table, re-ranked for the job you're hiring a model for.

AI Agents 1. Claude Fable 5 91.5 2. Claude Opus 4.8 89.3 3. Gemini 3.1 Pro Preview 85.4 View 118 models → Coding 1. Claude Fable 5 91.5 2. Claude Opus 4.8 89.3 3. Gemini 3.1 Pro Preview 85.4 View 167 models → Content Writing 1. Claude Fable 5 91.5 2. Claude Opus 4.8 89.3 3. Gemini 3.1 Pro Preview 85.4 View 183 models → General 1. Claude Fable 5 91.5 2. Claude Opus 4.8 89.3 3. Gemini 3.1 Pro Preview 85.4 View 69 models → Tool Use 1. Claude Fable 5 91.5 2. Claude Opus 4.8 89.3 3. Gemini 3.1 Pro Preview 85.4 View 316 models →

How we rank AI models

The Design for Online AI Model Leaderboard scores 612 models on a single 0–100 scale built from four weighted dimensions: intelligence (reasoning and knowledge benchmarks), technical capability (coding and tool use), content quality (writing and instruction-following) and value (capability per dollar).

Underlying data is aggregated from the OpenRouter API for pricing and availability, Artificial Analysis for intelligence, coding and agentic indices, and the Hugging Face Open LLM Leaderboard for open-model benchmarks. The fourth source is our own: we deploy these models in client agents, chatbots and automations every week, and that internal testing feeds the editorial layer, so a model that benchmarks well but is impractical to deploy will not automatically top the table.

Models are grouped into tiers (Frontier, Professional, Specialist, Efficient, Emerging and Legacy) to make like-for-like comparison easier, and newly released models are flagged so you can see what has just landed.

Leaderboard FAQ

How often is the leaderboard updated?

Pricing, availability and benchmark data are synced daily from our sources, and editorial scores are reviewed whenever a significant new model is released.

How is the overall score calculated?

Each model is graded 0–10 on intelligence, technical capability, content quality and value; those dimensions are weighted and combined into the 0–100 overall score used to rank the table.

Where does the data come from?

From four sources: the OpenRouter API, Artificial Analysis, the Hugging Face Open LLM Leaderboard, and internal testing from real deployments by the Design for Online team.