xAI: Grok 4.20
Analysis Summary
Grok 4.20 is xAI's benchmarked flagship in this tier, with an intelligence index of 37 and an agentic index of 65.4. Its GPQA score of 0.911 is among the highest in this batch, and the 2M token context window is a standout capability for long-document and codebase analysis. Vision, tool use, and function calling are all supported.
For businesses, this model suits complex reasoning tasks, long-context document workflows, and agentic pipelines where multi-step tool use is required. Instruction-following scores are strong at 0.812, and long-context reasoning is adequate. The lack of a coding index limits confidence in software engineering tasks specifically.
At $1.25 input and $2.50 output, pricing is mid-range. The 2M context window combined with strong reasoning and full tool use makes it a compelling option for teams handling large documents or multi-turn agentic workflows, though frontier-tier models still lead on raw intelligence.
Assessed June 30, 2026
Editorial notes
Grok 4.20 from xAI delivers strong reasoning, a 2M token context window, vision, tool use, and function calling, with a high GPQA score of 0.911 and competitive agentic performance.
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
How xAI: Grok 4.20 compares
XAI: Grok 4.20 ranks #46 of 385 AI models we track for overall intelligence, #38 of 293 for agentic tasks. Its 2M-token context window is larger than 100% of the models we list. At $1.25 per million input tokens it is cheaper than 21% of comparable models.
About xAI: Grok 4.20
Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering..
Capabilities
Benchmark Scores
Intelligence
Technical
Content
Benchmark data from Artificial Analysis and Hugging Face
How does xAI: Grok 4.20 stack up?
Compare side-by-side with other professional models.
Model Information
| OpenRouter ID |
x-ai/grok-4.20
|
| Provider | x-ai |
| Release Date | March 31, 2026 |
| Context Length | 2,000,000 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $1.25 | $0.001250 |
| Output | $2.50 | $0.002500 |
Live Performance
Live endpoint metrics, refreshed every 30 minutes.
External Resources
Explore Related Models
Frequently asked questions about xAI: Grok 4.20
How much does xAI: Grok 4.20 cost?
xAI: Grok 4.20 costs $1.25 per million input tokens and $2.50 per million output tokens.
What is the context window of xAI: Grok 4.20?
xAI: Grok 4.20 has a context window of 2,000,000 tokens (2M).
What can xAI: Grok 4.20 do?
xAI: Grok 4.20 supports image/vision input, tool use, and function calling.
Who created xAI: Grok 4.20?
xAI: Grok 4.20 is developed by xAI and was released on March 31, 2026.
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: June 30, 2026 9:37 pm