xAI: Grok 4

xAI: Grok 4

x-ai · Released Jul 9, 2025 Professional
Intelligence #52 / 557
73.3 Our Score
Speed #225 / 259
41.6 tokens / sec
Input #505 / 560
$3.00 per 1M tokens
Output #513 / 560
$15.00 per 1M tokens
Context #157 / 560
256,000 tokens

Analysis Summary

xAI: Grok 4 sits in the Professional tier on our leaderboard, ranked #52 of 557 published models on overall intelligence. At $3.00 input and $15.00 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use, function calling, vision, and reasoning.

Editorial notes

Grok 4 from xAI delivers strong reasoning, vision support, tool use, and function calling with a 256K context window, backed by standout math and GPQA scores; pricing is moderate and the agentic index is competitive for business workflows.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence7.5Technical7.5Value6Content7.5
Intelligence 7.5/10
Technical 7.5/10
Content 7.5/10
Value 6/10

Grok 4 is xAI's latest reasoning model with a 256k context window. It supports parallel tool calling, structured outputs, and both image and text inputs. Note that reasoning is not..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

41.5 Intelligence Index
40.5 Coding Index
56.4 Agentic Index
92.7 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 87.7% Graduate-level scientific reasoning
HLE 23.9% Humanity's Last Exam
MMLU Pro 86.6% Multi-task language understanding
MATH 500 99% Mathematical problem-solving
AIME 94.3% Competition mathematics
AIME 2025 92.7% Competition mathematics (2025)
SciCode 45.7% Scientific computing

Technical

LiveCodeBench 81.9% Live coding evaluation
TerminalBench Hard 37.9% Agentic terminal tasks
τ²-Bench 74.9% Conversational agent benchmark

Content

IFBench 53.7% Instruction following
LCR 68% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does xAI: Grok 4 stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID x-ai/grok-4
Providerx-ai
Release Date July 9, 2025
Context Length256,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $3.00 $0.003000
Output $15.00 $0.015000

Leaderboard Categories