xAI: Grok 4

Name: xAI: Grok 4 Review
Item: xAI: Grok 4
Author: Design for Online Editorial

▼3GLM 5.2now #8 ▲1GPT-5.5now #7 ▲1Claude Sonnet 5now #5 ▲1Grok 4.5now #6 PRICE DROPQwen3 30B A3B Instruct 2507down 36%PRICE DROPTrinity Large Thinkingdown 12%PRICE DROPNemotron 3 Superdown 11%PRICE DROPKimi K2.6down 20%

xAI: Grok 4

x-ai · Released Jul 9, 2025

Intelligence #9 / 620

82.0 our score

Speed #259 / 289

41.6 tok/s

Input Price #552 / 620

$3.00 per 1M tokens

Output Price #559 / 620

$15.00 per 1M tokens

Context #209 / 620

256,000 tokens

Grok 4 is xAI's capable reasoning model, released in mid-2025, with an intelligence index of 33.3 and an agentic index of 56.4 that places it among the strongest agentic performers in the broader field. Its math index of 92.7, GPQA of 0.877, and livecodebench of 0.819 reflect genuine frontier-adjacent capability across reasoning, science, and coding tasks. Vision support and a 256K context window add further breadth.

For businesses, Grok 4 suits complex software engineering, autonomous agent workflows, long-document analysis, and tasks requiring high-accuracy scientific or mathematical reasoning. The combination of strong agentic reliability, vision, tool use, and function calling makes it a versatile choice for teams building multi-step pipelines.

At $3.00 input and $15.00 output per million tokens, it is priced at the premium tier. Teams where task complexity justifies the cost will find it a strong performer; those with high-volume, routine workloads should consider a cheaper alternative for the bulk of their traffic.

Assessed July 10, 2026

Editorial notes

Grok 4 from xAI delivers strong reasoning, a leading agentic index of 56.4, vision support, and top-tier math and coding benchmarks, with tool use and function calling across a 256K context window.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

DFO Verdict

Grok 4 from xAI delivers strong reasoning, a leading agentic index of 56.4, vision support, and top-tier math and coding benchmarks, with tool use and function calling across a 256K context window.

#9 of 620 overall

Benchmark scores

GPQA Diamond 87.7%

HLE 23.9%

MMLU Pro 86.6%

MATH 500 99%

AIME 94.3%

AIME 2025 92.7%

SciCode 45.7%

LiveCodeBench 81.9%

TerminalBench Hard 37.9%

τ²-Bench 74.9%

IFBench 53.7%

LCR 68%

Magenta = intelligence · Ink = technical/agentic · Cyan = content & long-context · Grey = community benchmarks. Data: Artificial Analysis, Hugging Face.

33.3 Intelligence Index·56.4 Agentic Index·92.7 Math Index

How xAI: Grok 4 compares

XAI: Grok 4 ranks #74 of 395 AI models we track for overall intelligence, #61 of 302 for agentic tasks. Its 256K-token context window is larger than 66% of the models we list. At $3.00 per million input tokens it is cheaper than 11% of comparable models.

Position in the field

Intelligence: smarter than 99% of models #9

Speed: faster than 10% of models #259

Price: cheaper than 11% of models #552

Context: larger than 66% of models #209

worst in fieldmedianbest in field

Price vs frontier peers · $ per 1M tokens

xAI: Grok 4 $3.00 in $15.00 out

Anthropic: Claude Fable 5 $10.00 in $50.00 out

Anthropic: Claude Opus 4.8 $5.00 in $25.00 out

Google: Gemini 3.1 Pro Preview $2.00 in $12.00 out

Dark bar = input · light bar = output, scaled to the priciest peer.

Context window vs peers · tokens

Google: Gemini 3.1 Pro Preview 1M

Anthropic: Claude Fable 5 1M

Anthropic: Claude Opus 4.8 1M

xAI: Grok 4 256K

1M tokens ≈ 8 full-length novels or ~2,500 pages of business documents in a single request.

Performance profile

Strongest on content. The pulled-in intelligence corner is the trade-off, and if the shape matters more than the price, this is your model.

Compare shapes side-by-side →

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$3.00	$0.003000
Output	$15.00	$0.015000

What would xAI: Grok 4 cost your business?

Pick the job that looks most like yours, then fine-tune with the sliders. Estimates update live.

A website chatbot handling around 100 customer conversations a day, a few short messages each.

Requests per month 3,000

One request is one message, email, draft or automation call.

Size of each request 1,200 tokens

$0/mo xAI: Grok 4

$0/mo Anthropic: Claude Fable 5

$0/mo Google: Gemini 3.1 Pro Preview · best value

Full calculator with 620 models → Price Calculator

DFO AI AUTOMATION

These numbers get smaller with the right architecture.

We route routine calls to cheap models and save xAI: Grok 4 for the hard ones. Most clients cut their estimate by 60-80%.

Talk to our team

About xAI: Grok 4

Grok 4 is xAI's latest reasoning model with a 256k context window. It supports parallel tool calling, structured outputs, and both image and text inputs. Note that reasoning is not..

Embed this ranking

Writing about this model? Add the badge to your site. It always shows the current rank and score, and links back to this page.

<a href="https://designforonline.com/ai-models/xai-grok-4/"><img src="https://designforonline.com/?aiml_badge=xai-grok-4&theme=dark" alt="xAI: Grok 4, ranked #9 on the Design for Online AI Leaderboard" width="400" height="76"></a>

<a href="https://designforonline.com/ai-models/xai-grok-4/"><img src="https://designforonline.com/?aiml_badge=xai-grok-4&theme=light" alt="xAI: Grok 4, ranked #9 on the Design for Online AI Leaderboard" width="400" height="76"></a>

Frequently asked questions about xAI: Grok 4

How much does xAI: Grok 4 cost?

xAI: Grok 4 costs $3.00 per million input tokens and $15.00 per million output tokens.

What is the context window of xAI: Grok 4?

xAI: Grok 4 has a context window of 256,000 tokens (256K).

What can xAI: Grok 4 do?

xAI: Grok 4 supports image/vision input, tool use, and function calling.

Who created xAI: Grok 4?

xAI: Grok 4 is developed by xAI and was released on July 9, 2025.

Performance profile

Intelligence 5.5

Technical 6.7

Content 7

Value 6

Reasoning: Yes
Input
Output
Context: 256,000 tokens
Tokenizer: Grok
Released: Jul 9, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Model information

Provider x-ai

OpenRouter ID x-ai/grok-4

Status Active

Capabilities

Tool Use Function Calling Vision

Ranked in

AI Agents Coding General Tool Use

External resources View on OpenRouter API access, playground & provider details API Quickstart Sample code and integration guide

Data sourced from the OpenRouter API, Artificial Analysis, the Hugging Face Open LLM Leaderboard and our own internal testing. Scores are editorially curated by our team.

Last updated: July 25, 2026 4:16 pm

Issues with our rankings? Contact us

xAI: Grok 4

DFO Verdict

Benchmark scores

How xAI: Grok 4 compares

Pricing

What would xAI: Grok 4 cost your business?

About xAI: Grok 4

Explore Related Models

Embed this ranking

Frequently asked questions about xAI: Grok 4

How much does xAI: Grok 4 cost?

What is the context window of xAI: Grok 4?

What can xAI: Grok 4 do?

Who created xAI: Grok 4?