xAI: Grok 4.20 Multi-Agent Beta

xAI: Grok 4.20 Multi-Agent Beta

x-ai · Released Mar 12, 2026 Professional
Intelligence #21 / 583
77.0 Our Score
Speed #21 / 278
241.8 tokens / sec
Input #490 / 586
$2.00 per 1M tokens
Output #476 / 586
$6.00 per 1M tokens
Context #2 / 586
2M tokens

Analysis Summary

Grok 4.20 Multi-Agent Beta is xAI's specialist agentic variant, with an intelligence index of 48.5 and an agentic index of 68.7 that places it among the stronger multi-step reasoning models in the field. Vision and tool use are supported, and the 2 million token context window is one of the largest available, enabling analysis of very long documents, codebases, or conversation histories in a single pass.

For businesses, the model is well suited to autonomous agent pipelines, long-document analysis, and complex reasoning tasks where context depth matters. Instruction following is strong (ifbench 0.829), and the tau2 agentic reliability score is high. The coding index of 42.2 is capable but not class-leading, so pure software engineering tasks may benefit from a more coding-focused model.

At $2.00 input and $6.00 output per million tokens, it is priced at the premium end. The combination of a massive context window, strong agentic performance, and vision support makes it a compelling choice for businesses building sophisticated agent workflows where context capacity is a bottleneck.

Assessed June 6, 2026

Editorial notes

Grok 4.20 Multi-Agent Beta from xAI combines an excellent intelligence index with a strong agentic score, vision support, a 2M token context window, and high instruction-following accuracy for complex multi-step business workflows.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence7Technical7.4Value7.3Content8.2
Intelligence 7/10
Technical 7.4/10
Content 8.2/10
Value 7.3/10

How xAI: Grok 4.20 Multi-Agent Beta compares

XAI: Grok 4.20 Multi-Agent Beta ranks #10 of 382 AI models we track for overall intelligence, #41 of 111 for coding, #22 of 293 for agentic tasks. Its 2M-token context window is larger than 100% of the models we list. At $2.00 per million input tokens it is cheaper than 16% of comparable models.

About xAI: Grok 4.20 Multi-Agent Beta

Grok 4.20 Multi-Agent Beta is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesize information across complex tasks. Reasoning effort behavior:
- low / medium: 4 agents
- high / xhigh: 16 agents

Capabilities

Tool Use Vision

Performance Indices

Source: Artificial Analysis

48.5 Intelligence Index
42.2 Coding Index
68.7 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 88.5% Graduate-level scientific reasoning
HLE 30% Humanity's Last Exam
SciCode 44.7% Scientific computing

Technical

TerminalBench Hard 40.9% Agentic terminal tasks
τ²-Bench 96.5% Conversational agent benchmark

Content

IFBench 82.9% Instruction following
LCR 59% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does xAI: Grok 4.20 Multi-Agent Beta stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID x-ai/grok-4.20-multi-agent-beta
Providerx-ai
Release Date March 12, 2026
Context Length2,000,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $2.00 $0.002000
Output $6.00 $0.006000

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

4,561ms
Best Latency (TTFT)
563 tok/s
Best Throughput
0/2
Active Endpoints
Available via: xAI

Leaderboard Categories

Frequently asked questions about xAI: Grok 4.20 Multi-Agent Beta

How much does xAI: Grok 4.20 Multi-Agent Beta cost?

xAI: Grok 4.20 Multi-Agent Beta costs $2.00 per million input tokens and $6.00 per million output tokens.

What is the context window of xAI: Grok 4.20 Multi-Agent Beta?

xAI: Grok 4.20 Multi-Agent Beta has a context window of 2,000,000 tokens (2M).

Is xAI: Grok 4.20 Multi-Agent Beta good for coding?

On our coding benchmark index, xAI: Grok 4.20 Multi-Agent Beta ranks #41 of 111 models, placing it in the broader range of the field for code generation and debugging.

What can xAI: Grok 4.20 Multi-Agent Beta do?

xAI: Grok 4.20 Multi-Agent Beta supports image/vision input and tool use.

Who created xAI: Grok 4.20 Multi-Agent Beta?

xAI: Grok 4.20 Multi-Agent Beta is developed by xAI and was released on March 12, 2026.