xAI: Grok 4.20

xAI: Grok 4.20

x-ai · Released Mar 31, 2026 Professional
Intelligence #14 / 590
82.0 Our Score
Speed #21 / 278
230.7 tokens / sec
Input #464 / 590
$1.25 per 1M tokens
Output #417 / 590
$2.50 per 1M tokens
Context #2 / 590
2M tokens

Analysis Summary

Grok 4.20 is xAI's benchmarked flagship in this tier, with an intelligence index of 37 and an agentic index of 65.4. Its GPQA score of 0.911 is among the highest in this batch, and the 2M token context window is a standout capability for long-document and codebase analysis. Vision, tool use, and function calling are all supported.

For businesses, this model suits complex reasoning tasks, long-context document workflows, and agentic pipelines where multi-step tool use is required. Instruction-following scores are strong at 0.812, and long-context reasoning is adequate. The lack of a coding index limits confidence in software engineering tasks specifically.

At $1.25 input and $2.50 output, pricing is mid-range. The 2M context window combined with strong reasoning and full tool use makes it a compelling option for teams handling large documents or multi-turn agentic workflows, though frontier-tier models still lead on raw intelligence.

Assessed June 30, 2026

Editorial notes

Grok 4.20 from xAI delivers strong reasoning, a 2M token context window, vision, tool use, and function calling, with a high GPQA score of 0.911 and competitive agentic performance.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence6Technical8Value7.3Content7.9
Intelligence 6/10
Technical 8/10
Content 7.9/10
Value 7.3/10

How xAI: Grok 4.20 compares

XAI: Grok 4.20 ranks #46 of 385 AI models we track for overall intelligence, #38 of 293 for agentic tasks. Its 2M-token context window is larger than 100% of the models we list. At $1.25 per million input tokens it is cheaper than 21% of comparable models.

About xAI: Grok 4.20

Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

37 Intelligence Index
65.4 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 91.1% Graduate-level scientific reasoning
HLE 32.2% Humanity's Last Exam
SciCode 45.6% Scientific computing

Technical

TerminalBench Hard 37.9% Agentic terminal tasks
τ²-Bench 93% Conversational agent benchmark

Content

IFBench 81.2% Instruction following
LCR 58% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does xAI: Grok 4.20 stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID x-ai/grok-4.20
Providerx-ai
Release Date March 31, 2026
Context Length2,000,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $1.25 $0.001250
Output $2.50 $0.002500

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

99.9%
Avg Uptime
391ms
Best Latency (TTFT)
227 tok/s
Best Throughput
2/2
Active Endpoints
Available via: xAI

Leaderboard Categories

Frequently asked questions about xAI: Grok 4.20

How much does xAI: Grok 4.20 cost?

xAI: Grok 4.20 costs $1.25 per million input tokens and $2.50 per million output tokens.

What is the context window of xAI: Grok 4.20?

xAI: Grok 4.20 has a context window of 2,000,000 tokens (2M).

What can xAI: Grok 4.20 do?

xAI: Grok 4.20 supports image/vision input, tool use, and function calling.

Who created xAI: Grok 4.20?

xAI: Grok 4.20 is developed by xAI and was released on March 31, 2026.