xAI: Grok 4.20 Multi-Agent Beta

xAI: Grok 4.20 Multi-Agent Beta

x-ai · Released Mar 12, 2026 Professional
Intelligence #14 / 556
82.7 Our Score
Speed #18 / 257
241.8 tokens / sec
Input #467 / 557
$2.00 per 1M tokens
Output #455 / 557
$6.00 per 1M tokens
Context #2 / 557
2M tokens

Analysis Summary

xAI: Grok 4.20 Multi-Agent Beta sits in the Professional tier on our leaderboard, ranked #14 of 556 published models on overall intelligence. At $2.00 input and $6.00 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, vision, and reasoning.

Editorial notes

Grok 4.20 Multi-Agent Beta from xAI combines an intelligence index of 48.5 with a top-tier agentic index of 68.7, a 2M token context, vision, and strong instruction following at $2.00 input per million tokens.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence8.4Technical8.1Value7.3Content8
Intelligence 8.4/10
Technical 8.1/10
Content 8/10
Value 7.3/10

Grok 4.20 Multi-Agent Beta is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesize information across complex tasks. Reasoning effort behavior:
- low / medium: 4 agents
- high / xhigh: 16 agents

Capabilities

Tool Use Vision

Performance Indices

Source: Artificial Analysis

48.5 Intelligence Index
42.2 Coding Index
68.7 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 88.5% Graduate-level scientific reasoning
HLE 30% Humanity's Last Exam
SciCode 44.7% Scientific computing

Technical

TerminalBench Hard 40.9% Agentic terminal tasks
τ²-Bench 96.5% Conversational agent benchmark

Content

IFBench 82.9% Instruction following
LCR 59% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does xAI: Grok 4.20 Multi-Agent Beta stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID x-ai/grok-4.20-multi-agent-beta
Providerx-ai
Release Date March 12, 2026
Context Length2,000,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $2.00 $0.002000
Output $6.00 $0.006000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
25,983ms
Best Latency (TTFT)
211 tok/s
Best Throughput
1/1
Active Endpoints
Available via: xAI

Leaderboard Categories