Mistral: Devstral Small 1.1

Mistral: Devstral Small 1.1

mistralai · Released Jul 10, 2025
38
Our Score

Devstral Small 1.1 is a 24B parameter open-weight language model for software engineering agents, developed by Mistral AI in collaboration with All Hands AI. Finetuned from Mistral Small 3.1 and released under the Apache 2.0 license, it features a 128k token context window and supports both Mistral-style function calling and XML output formats. Designed for agentic coding workflows, Devstral Small 1.1 is optimized for tasks such as codebase exploration, multi-file edits, and integration into autonomous development agents like OpenHands and Cline. It achieves 53.6% on SWE-Bench Verified, surpassing all other open models on this benchmark, while remaining lightweight enough to run on a single 4090 GPU or Apple silicon machine. The model uses a Tekken tokenizer with a 131k vocabulary and is deployable via vLLM, Transformers, Ollama, LM Studio, and other OpenAI-compatible runtimes.

$0.10 / 1M Input Price
$0.30 / 1M Output Price
131,072 tokens Context Window

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerMistral

Performance Indices

Source: Artificial Analysis

15.2 Intelligence Index
12.1 Coding Index
17.3 Agentic Index
29.3 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 41.4%
Graduate-level scientific reasoning
HLE 3.7%
Humanity's Last Exam
MMLU Pro 62.2%
Multi-task language understanding
LiveCodeBench 25.4%
Live coding evaluation
SciCode 24.3%
Scientific computing
MATH 500 63.5%
Mathematical problem-solving
AIME 0.3%
Competition mathematics
AIME 2025 29.3%
Competition mathematics (2025)
IFBench 34.6%
Instruction following
LCR 17%
Long-context reasoning
TerminalBench Hard 6.1%
Agentic terminal tasks
τ²-Bench 28.4%
Conversational agent benchmark

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID mistralai/devstral-small
Providermistralai
Release Date July 10, 2025
Context Length131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.10 $0.000100
Output $0.30 $0.000300

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
438ms
Best Latency (TTFT)
92 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Mistral

Leaderboard Categories