xAI: Grok 4 Fast

xAI: Grok 4 Fast

x-ai · Released Sep 19, 2025
70
Our Score

Grok 4 Fast is xAI's latest multimodal model with SOTA cost-efficiency and a 2M token context window. It comes in two flavors: non-reasoning and reasoning. Read more about the model on xAI's news post. Reasoning can be enabled/disabled using the reasoning enabled parameter in the API. Learn more in our docs

$0.20 / 1M Input Price
$0.50 / 1M Output Price
2M tokens Context Window
30,000 tokens Max Output

Capabilities

Tool Use Function Calling Vision

Architecture

ModalityText + Image → Text
TokenizerGrok

Performance Indices

Source: Artificial Analysis

23.1 Intelligence Index
19 Coding Index
37.9 Agentic Index
41.3 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 60.6%
Graduate-level scientific reasoning
HLE 5%
Humanity's Last Exam
MMLU Pro 73%
Multi-task language understanding
LiveCodeBench 40.1%
Live coding evaluation
SciCode 32.9%
Scientific computing
AIME 2025 41.3%
Competition mathematics (2025)
IFBench 37.7%
Instruction following
LCR 20%
Long-context reasoning
TerminalBench Hard 12.1%
Agentic terminal tasks
τ²-Bench 63.7%
Conversational agent benchmark

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID x-ai/grok-4-fast
Providerx-ai
Release Date September 19, 2025
Context Length2,000,000 tokens
Max Completion30,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.20 $0.000200
Output $0.50 $0.000500

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
2,366ms
Best Latency (TTFT)
159 tok/s
Best Throughput
1/1
Active Endpoints
Available via: xAI

Leaderboard Categories