Home > AI Models > xAI: Grok 4.20

xAI: Grok 4.20

Name: xAI: Grok 4.20 Review
Item: xAI: Grok 4.20
Author: Design for Online Editorial

PRICE DROPQwen3.6 27Bdown 33%PRICE DROPQwen3 Next 80B A3B Instructdown 33%PRICE DROPNemotron 3 Ultradown 39%PRICE DROPQwen3.5-27Bdown 40%PRICE DROPGemma 3 27Bdown 20%PRICE DROPMiniMax M2down 15%

xAI: Grok 4.20

x-ai · Released Mar 31, 2026

Intelligence #9 / 612

82.0 our score

Speed #61 / 288

151.4 tok/s

Input Price #481 / 620

$1.25 per 1M tokens

Output Price #428 / 620

$2.50 per 1M tokens

Context #1 / 620

2M tokens

Grok 4.20 is an earlier-generation xAI model with a 2 million token context window, vision, tool use, and function calling. The intelligence index of 21.8 is in the limited range, and the agentic index of 38.3 is lower than the top agentic models in the landscape, including the later Grok 4.20 Multi-Agent Beta which scores 48.5 on intelligence.

For businesses, the very large context window is a genuine asset for long-document workflows, and the multimodal input (text, image, file) adds flexibility. However, reasoning depth and agentic reliability lag behind current-generation models, which limits its suitability for complex autonomous tasks or high-stakes outputs.

At $1.25 input and $2.50 output, pricing is moderate. Teams needing a large-context xAI model for lighter reasoning or document-processing tasks may find it useful, but the newer Grok 4.20 Multi-Agent Beta or Grok 4.5 are stronger choices where budget allows.

Assessed July 10, 2026

Editorial notes

Grok 4.20 from xAI has a 2M token context, vision, and tool use, but an intelligence index of 21.8 and agentic index of 38.3 place it well below the current xAI flagship tier.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

DFO Verdict

Grok 4.20 from xAI has a 2M token context, vision, and tool use, but an intelligence index of 21.8 and agentic index of 38.3 place it well below the current xAI flagship tier.

#9 of 612 overall

Benchmark scores

GPQA Diamond 77.6%

HLE 24.2%

SciCode 32.8%

TerminalBench Hard 16.7%

τ²-Bench 59.9%

IFBench 49.3%

LCR 17.3%

Magenta = intelligence · Ink = technical/agentic · Cyan = content & long-context · Grey = community benchmarks. Data: Artificial Analysis, Hugging Face.

21.8 Intelligence Index·38.3 Agentic Index

How xAI: Grok 4.20 compares

XAI: Grok 4.20 ranks #135 of 394 AI models we track for overall intelligence, #105 of 301 for agentic tasks. Its 2M-token context window is larger than 100% of the models we list. At $1.25 per million input tokens it is cheaper than 22% of comparable models.

Position in the field

Intelligence: smarter than 99% of models #9

Speed: faster than 79% of models #61

Price: cheaper than 22% of models #481

Context: larger than 100% of models #1

worst in fieldmedianbest in field

Price vs frontier peers · $ per 1M tokens

xAI: Grok 4.20 $1.25 in $2.50 out

Anthropic: Claude Fable 5 $10.00 in $50.00 out

Anthropic: Claude Opus 4.8 $5.00 in $25.00 out

Google: Gemini 3.1 Pro Preview $2.00 in $12.00 out

Dark bar = input · light bar = output, scaled to the priciest peer.

Context window vs peers · tokens

xAI: Grok 4.20 2M

Google: Gemini 3.1 Pro Preview 1M

Anthropic: Claude Fable 5 1M

Anthropic: Claude Opus 4.8 1M

1M tokens ≈ 8 full-length novels or ~2,500 pages of business documents in a single request.

Performance profile

Strongest on value. The pulled-in content corner is the trade-off, and if the shape matters more than the price, this is your model.

Compare shapes side-by-side →

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$1.25	$0.001250
Output	$2.50	$0.002500

What would xAI: Grok 4.20 cost your business?

Pick the job that looks most like yours, then fine-tune with the sliders. Estimates update live.

A website chatbot handling around 100 customer conversations a day, a few short messages each.

Requests per month 3,000

One request is one message, email, draft or automation call.

Size of each request 1,200 tokens

$0/mo xAI: Grok 4.20

$0/mo Anthropic: Claude Fable 5

$0/mo Z.ai: GLM 5.2 · best value

Full calculator with 620 models → Price Calculator

DFO AI AUTOMATION

These numbers get smaller with the right architecture.

We route routine calls to cheap models and save xAI: Grok 4.20 for the hard ones. Most clients cut their estimate by 60-80%.

Talk to our team

About xAI: Grok 4.20

Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering..

Embed this ranking

Writing about this model? Add the badge to your site. It always shows the current rank and score, and links back to this page.

<a href="https://designforonline.com/ai-models/xai-grok-4-20/"><img src="https://designforonline.com/?aiml_badge=xai-grok-4-20&theme=dark" alt="xAI: Grok 4.20, ranked #9 on the Design for Online AI Leaderboard" width="400" height="76"></a>

<a href="https://designforonline.com/ai-models/xai-grok-4-20/"><img src="https://designforonline.com/?aiml_badge=xai-grok-4-20&theme=light" alt="xAI: Grok 4.20, ranked #9 on the Design for Online AI Leaderboard" width="400" height="76"></a>

Frequently asked questions about xAI: Grok 4.20

How much does xAI: Grok 4.20 cost?

xAI: Grok 4.20 costs $1.25 per million input tokens and $2.50 per million output tokens.

What is the context window of xAI: Grok 4.20?

xAI: Grok 4.20 has a context window of 2,000,000 tokens (2M).

What can xAI: Grok 4.20 do?

xAI: Grok 4.20 supports image/vision input, tool use, and function calling.

Who created xAI: Grok 4.20?

xAI: Grok 4.20 is developed by xAI and was released on March 31, 2026.

Performance profile

Intelligence 4

Technical 4.2

Content 3.8

Value 7.3

Reasoning: Yes
Input
Output
Context: 2M tokens
Tokenizer: Grok
Released: Mar 31, 2026

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Model information

Provider x-ai

OpenRouter ID x-ai/grok-4.20

Status Active

Capabilities

Tool Use Function Calling Vision

Ranked in

Tool Use

Live performance · 30 min refresh

100% Avg uptime

349ms Best latency

168 tok/s Best throughput

4/4 Active endpoints

External resources View on OpenRouter API access, playground & provider details API Quickstart Sample code and integration guide

Data sourced from the OpenRouter API, Artificial Analysis, the Hugging Face Open LLM Leaderboard and our own internal testing. Scores are editorially curated by our team.

Last updated: July 24, 2026 8:38 pm

Issues with our rankings? Contact us

xAI: Grok 4.20

DFO Verdict

Benchmark scores

How xAI: Grok 4.20 compares

Pricing

What would xAI: Grok 4.20 cost your business?

About xAI: Grok 4.20

Explore Related Models

Embed this ranking

Frequently asked questions about xAI: Grok 4.20

How much does xAI: Grok 4.20 cost?

What is the context window of xAI: Grok 4.20?

What can xAI: Grok 4.20 do?

Who created xAI: Grok 4.20?