Mistral: Devstral Medium

Mistral: Devstral Medium

mistralai · Released Jul 10, 2025 Efficient
42.7
Our Score

Performance Profile

Intelligence3.6Technical2.8Value7.3Content3.5
Intelligence 3.6/10
Technical 2.8/10
Content 3.5/10
Value 7.3/10

Devstral Medium is a high-performance code generation and agentic reasoning model developed jointly by Mistral AI and All Hands AI. Positioned as a step up from Devstral Small, it achieves 61.6% on SWE-Bench Verified, placing it ahead of Gemini 2.5 Pro and GPT-4.1 in code-related tasks, at a fraction of the cost. It is designed for generalization across prompt styles and tool use in code agents and frameworks. Devstral Medium is available via API only (not open-weight), and supports enterprise deployment on private infrastructure, with optional fine-tuning capabilities.

$0.40 / 1M
Input Price
$2.00 / 1M
Output Price
131,072 tokens
Context Window

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerMistral

Performance Indices

Source: Artificial Analysis

18.7 Intelligence Index
15.9 Coding Index
14.5 Agentic Index
4.7 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 49.2% Graduate-level scientific reasoning
HLE 3.8% Humanity's Last Exam
MMLU Pro 70.8% Multi-task language understanding
MATH 500 70.7% Mathematical problem-solving
AIME 6.7% Competition mathematics
AIME 2025 4.7% Competition mathematics (2025)
SciCode 29.4% Scientific computing

Technical

LiveCodeBench 33.7% Live coding evaluation
TerminalBench Hard 9.1% Agentic terminal tasks
τ²-Bench 19.9% Conversational agent benchmark

Content

IFBench 29.9% Instruction following
LCR 28.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID mistralai/devstral-medium
Providermistralai
Release Date July 10, 2025
Context Length131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.40 $0.000400
Output $2.00 $0.002000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

287ms
Best Latency (TTFT)
40 tok/s
Best Throughput
0/1
Active Endpoints
Available via: Mistral

Leaderboard Categories