xAI: Grok 4.20 Multi-Agent Beta

xAI: Grok 4.20 Multi-Agent Beta

x-ai · Released Mar 12, 2026 Professional
Intelligence #14 / 523
82.7 Our Score
Speed #14 / 244
241.8 tokens / sec
Input #439 / 525
$2.00 per 1M tokens
Output #428 / 525
$6.00 per 1M tokens
Context #1 / 525
2M tokens

Analysis Summary

xAI: Grok 4.20 Multi-Agent Beta sits in the Professional tier on our leaderboard, ranked #14 of 523 published models on overall intelligence. At $2.00 input and $6.00 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, vision, and reasoning.

Editorial notes

xAI's Grok 4.20 Multi-Agent Beta is a powerful multi-agent model with a remarkable 2M token context window, strong reasoning, vision support, and top-tier agentic scores — though its premium pricing ($2/$6 per million tokens) positions it for enterprise use cases where context depth and agent reliability justify the cost.

Assessed April 23, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence8.4Technical8.1Value7.3Content8
Intelligence 8.4/10
Technical 8.1/10
Content 8/10
Value 7.3/10

Grok 4.20 Multi-Agent Beta is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesize information across complex tasks. Reasoning effort behavior:
- low / medium: 4 agents
- high / xhigh: 16 agents

Capabilities

Tool Use Vision

Performance Indices

Source: Artificial Analysis

48.5 Intelligence Index
42.2 Coding Index
68.7 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 88.5% Graduate-level scientific reasoning
HLE 30% Humanity's Last Exam
SciCode 44.7% Scientific computing

Technical

TerminalBench Hard 40.9% Agentic terminal tasks
τ²-Bench 96.5% Conversational agent benchmark

Content

IFBench 82.9% Instruction following
LCR 59% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does xAI: Grok 4.20 Multi-Agent Beta stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID x-ai/grok-4.20-multi-agent-beta
Providerx-ai
Release Date March 12, 2026
Context Length2,000,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $2.00 $0.002000
Output $6.00 $0.006000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

14,797ms
Best Latency (TTFT)
257.5 tok/s
Best Throughput
0/1
Active Endpoints
Available via: xAI

Leaderboard Categories