Grok 4.20 0309 (Reasoning)

Grok 4.20 0309 (Reasoning)

xAI · Released Mar 10, 2026 Professional
Intelligence #14 / 590
82.0 Our Score
Speed #28 / 279
214.1 tokens / sec
Input #494 / 592
$2.00 per 1M tokens
Output #480 / 592
$6.00 per 1M tokens
Context
— Not reported

Analysis Summary

Grok 4.20 0309 in reasoning mode is a strong performer from xAI's March 2026 release. Its agentic index of 68.7 places it in the top tier for autonomous task execution, and its instruction-following score of 0.829 is among the best in the field. GPQA of 0.885 and HLE of 0.30 confirm deep reasoning capability, while terminalbench of 0.409 and tau2 of 0.965 show reliable tool use and task completion.

For businesses, this model is well-suited to agentic pipelines, complex multi-step reasoning, software engineering assistance, and workflows requiring precise instruction adherence. The long-context reliability score of 0.59 is good, supporting document-heavy tasks. Its intelligence index of 36.5 is strong but sits below the very top frontier models, so for the most demanding reasoning tasks, a higher-tier model may still be preferred.

At $2 input and $6 output per million tokens, pricing is reasonable for the capability level, particularly given the agentic and instruction-following strengths. Teams building autonomous agents or needing reliable tool use at a mid-tier price point will find this a compelling option.

Assessed June 30, 2026

Editorial notes

Grok 4.20 0309 (Reasoning) from xAI delivers strong agentic performance with an index of 68.7, excellent instruction-following at 0.83, and high GPQA and HLE scores, making it a capable choice for complex reasoning and autonomous workflows at a competitive price point.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence5.8Technical8.1Value6Content8.1
Intelligence 5.8/10
Technical 8.1/10
Content 8.1/10
Value 6/10

How Grok 4.20 0309 (Reasoning) compares

Grok 4.20 0309 (Reasoning) ranks #47 of 385 AI models we track for overall intelligence, #21 of 293 for agentic tasks. At $2.00 per million input tokens it is cheaper than 17% of comparable models.

Performance Indices

Source: Artificial Analysis

36.5 Intelligence Index
68.7 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 88.5% Graduate-level scientific reasoning
HLE 30% Humanity's Last Exam
SciCode 44.7% Scientific computing

Technical

TerminalBench Hard 40.9% Agentic terminal tasks
τ²-Bench 96.5% Conversational agent benchmark

Content

IFBench 82.9% Instruction following
LCR 59% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Grok 4.20 0309 (Reasoning) stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

ProviderxAI
Release Date March 10, 2026
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $2.00 $0.002000
Output $6.00 $0.006000

Leaderboard Categories

Frequently asked questions about Grok 4.20 0309 (Reasoning)

How much does Grok 4.20 0309 (Reasoning) cost?

Grok 4.20 0309 (Reasoning) costs $2.00 per million input tokens and $6.00 per million output tokens.

Who created Grok 4.20 0309 (Reasoning)?

Grok 4.20 0309 (Reasoning) is developed by xAI and was released on March 10, 2026.