Mistral: Mistral Medium 3

Mistral: Mistral Medium 3

mistralai · Released May 7, 2025 Efficient
Intelligence #155 / 523
46.9 Our Score
Speed #157 / 236
64.9 tokens / sec
Input #326 / 523
$0.400 per 1M tokens
Output #350 / 523
$2.00 per 1M tokens
Context #183 / 523
131,072 tokens

Analysis Summary

Mistral: Mistral Medium 3 sits in the Efficient tier on our leaderboard, ranked #155 of 523 published models on overall intelligence. At $0.400 input and $2.00 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports tool use, function calling, and vision.

Editorial notes

Mistral Medium 3 from Mistral AI offers a solid mid-tier option with tool use, function calling, and vision support across a 131K context window at competitive pricing. Benchmark scores are modest but respectable for a mid-range model, and Mistral's Western-market focus and API reliability make it a practical choice for content and tool-use workflows.

Assessed April 16, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.8Technical2.5Value7.3Content5
Intelligence 3.8/10
Technical 2.5/10
Content 5/10
Value 7.3/10

Mistral Medium 3 is a high-performance enterprise-grade language model designed to deliver frontier-level capabilities at significantly reduced operational cost. It balances state-of-the-art reasoning and multimodal performance with 8Ɨ lower cost..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

18.8 Intelligence Index
13.6 Coding Index
14.1 Agentic Index
30.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 57.8% Graduate-level scientific reasoning
HLE 4.3% Humanity's Last Exam
MMLU Pro 76% Multi-task language understanding
MATH 500 90.7% Mathematical problem-solving
AIME 44% Competition mathematics
AIME 2025 30.3% Competition mathematics (2025)
SciCode 33.1% Scientific computing

Technical

LiveCodeBench 40% Live coding evaluation
TerminalBench Hard 3.8% Agentic terminal tasks
τ²-Bench 24.3% Conversational agent benchmark

Content

IFBench 39.3% Instruction following
LCR 28% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Mistral: Mistral Medium 3 stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID mistralai/mistral-medium-3
Providermistralai
Model FamilyMistral
Release Date May 7, 2025
Context Length131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.40 $0.000400
Output $2.00 $0.002000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
370ms
Best Latency (TTFT)
43 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Mistral

Leaderboard Categories