xAI: Grok 3

xAI: Grok 3

x-ai · Released Jun 10, 2025 Efficient
48.7
Our Score

Performance Profile

Intelligence4.8Technical3.8Value5.8Content5.5
Intelligence 4.8/10
Technical 3.8/10
Content 5.5/10
Value 5.8/10

Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in finance, healthcare, law, and science.

$3.00 / 1M
Input Price
$15.00 / 1M
Output Price
131,072 tokens
Context Window

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerGrok

Performance Indices

Source: Artificial Analysis

25.2 Intelligence Index
19.8 Coding Index
30.1 Agentic Index
58 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 69.3% Graduate-level scientific reasoning
HLE 5.1% Humanity's Last Exam
MMLU Pro 79.9% Multi-task language understanding
MATH 500 87% Mathematical problem-solving
AIME 33% Competition mathematics
AIME 2025 58% Competition mathematics (2025)
SciCode 36.8% Scientific computing

Technical

LiveCodeBench 42.5% Live coding evaluation
TerminalBench Hard 11.4% Agentic terminal tasks
τ²-Bench 48.8% Conversational agent benchmark

Content

IFBench 46.9% Instruction following
LCR 54.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID x-ai/grok-3
Providerx-ai
Release Date June 10, 2025
Context Length131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $3.00 $0.003000
Output $15.00 $0.015000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

95.7%
Avg Uptime
507ms
Best Latency (TTFT)
32 tok/s
Best Throughput
2/2
Active Endpoints
Available via: xAI

Leaderboard Categories