xAI: Grok 3

xAI: Grok 3

x-ai · Released Jun 10, 2025
60
Our Score

Grok 3 is the latest model from xAI. It's their flagship model that excels at enterprise use cases like data extraction, coding, and text summarization. Possesses deep domain knowledge in finance, healthcare, law, and science.

$3.00 / 1M Input Price
$15.00 / 1M Output Price
131,072 tokens Context Window

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerGrok

Performance Indices

Source: Artificial Analysis

25.2 Intelligence Index
19.8 Coding Index
30.1 Agentic Index
58 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 69.3%
Graduate-level scientific reasoning
HLE 5.1%
Humanity's Last Exam
MMLU Pro 79.9%
Multi-task language understanding
LiveCodeBench 42.5%
Live coding evaluation
SciCode 36.8%
Scientific computing
MATH 500 87%
Mathematical problem-solving
AIME 33%
Competition mathematics
AIME 2025 58%
Competition mathematics (2025)
IFBench 46.9%
Instruction following
LCR 54.7%
Long-context reasoning
TerminalBench Hard 11.4%
Agentic terminal tasks
τ²-Bench 48.8%
Conversational agent benchmark

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID x-ai/grok-3
Providerx-ai
Release Date June 10, 2025
Context Length131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $3.00 $0.003000
Output $15.00 $0.015000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
485ms
Best Latency (TTFT)
56 tok/s
Best Throughput
1/2
Active Endpoints
Available via: xAI

Leaderboard Categories