xAI: Grok 3 Mini

xAI: Grok 3 Mini

x-ai · Released Jun 10, 2025 Specialist
Intelligence #77 / 523
63.0 Our Score
Speed #17 / 236
215.4 tokens / sec
Input #298 / 523
$0.300 per 1M tokens
Output #231 / 523
$0.500 per 1M tokens
Context #183 / 523
131,072 tokens

Analysis Summary

xAI: Grok 3 Mini sits in the Specialist tier on our leaderboard, ranked #77 of 523 published models on overall intelligence. At $0.300 input and $0.500 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports tool use, function calling, and reasoning.

Editorial notes

xAI's Grok 3 Mini is a surprisingly capable and affordable model, with exceptional agentic and mathematics benchmark scores relative to its $0.30/1M input price point, and strong performance on live coding evaluations. It's a compelling value option for businesses needing solid reasoning and agentic capability without the cost of frontier flagships.

Assessed April 16, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence5.9Technical5.6Value7.8Content6
Intelligence 5.9/10
Technical 5.6/10
Content 6/10
Value 7.8/10

A lightweight model that thinks before responding. Fast, smart, and great for logic-based tasks that do not require deep domain knowledge. The raw thinking traces are accessible.

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

32.1 Intelligence Index
25.2 Coding Index
53.9 Agentic Index
84.7 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 79.1% Graduate-level scientific reasoning
HLE 11.1% Humanity's Last Exam
MMLU Pro 82.8% Multi-task language understanding
MATH 500 99.2% Mathematical problem-solving
AIME 93.3% Competition mathematics
AIME 2025 84.7% Competition mathematics (2025)
SciCode 40.6% Scientific computing

Technical

LiveCodeBench 69.6% Live coding evaluation
TerminalBench Hard 17.4% Agentic terminal tasks
τ²-Bench 90.4% Conversational agent benchmark

Content

IFBench 45.9% Instruction following
LCR 50.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does xAI: Grok 3 Mini stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID x-ai/grok-3-mini
Providerx-ai
Release Date June 10, 2025
Context Length131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.30 $0.000300
Output $0.50 $0.000500

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.9%
Avg Uptime
370ms
Best Latency (TTFT)
75 tok/s
Best Throughput
1/2
Active Endpoints
Available via: xAI

Leaderboard Categories