Mistral: Devstral Medium

Mistral: Devstral Medium

mistralai · Released Jul 10, 2025
44
Our Score

Devstral Medium is a high-performance code generation and agentic reasoning model developed jointly by Mistral AI and All Hands AI. Positioned as a step up from Devstral Small, it achieves 61.6% on SWE-Bench Verified, placing it ahead of Gemini 2.5 Pro and GPT-4.1 in code-related tasks, at a fraction of the cost. It is designed for generalization across prompt styles and tool use in code agents and frameworks. Devstral Medium is available via API only (not open-weight), and supports enterprise deployment on private infrastructure, with optional fine-tuning capabilities.

$0.40 / 1M Input Price
$2.00 / 1M Output Price
131,072 tokens Context Window

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerMistral

Performance Indices

Source: Artificial Analysis

18.7 Intelligence Index
15.9 Coding Index
14.5 Agentic Index
4.7 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 49.2%
Graduate-level scientific reasoning
HLE 3.8%
Humanity's Last Exam
MMLU Pro 70.8%
Multi-task language understanding
LiveCodeBench 33.7%
Live coding evaluation
SciCode 29.4%
Scientific computing
MATH 500 70.7%
Mathematical problem-solving
AIME 6.7%
Competition mathematics
AIME 2025 4.7%
Competition mathematics (2025)
IFBench 29.9%
Instruction following
LCR 28.7%
Long-context reasoning
TerminalBench Hard 9.1%
Agentic terminal tasks
τ²-Bench 19.9%
Conversational agent benchmark

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID mistralai/devstral-medium
Providermistralai
Release Date July 10, 2025
Context Length131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.40 $0.000400
Output $2.00 $0.002000

Leaderboard Categories