MiniMax: MiniMax M2.1

MiniMax: MiniMax M2.1

minimax · Released Dec 23, 2025
78
Our Score

MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world capability while maintaining exceptional latency, scalability, and cost efficiency. Compared to its predecessor, M2.1 delivers cleaner, more concise outputs and faster perceived response times. It shows leading multilingual coding performance across major systems and application languages, achieving 49.4% on Multi-SWE-Bench and 72.5% on SWE-Bench Multilingual, and serves as a versatile agent “brain” for IDEs, coding tools, and general-purpose assistance. To avoid degrading this model's performance, MiniMax highly recommends preserving reasoning between turns. Learn more about using reasoning_details to pass back reasoning in our docs.

$0.27 / 1M Input Price
$0.95 / 1M Output Price
196,608 tokens Context Window

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerOther

Performance Indices

Source: Artificial Analysis

39.4 Intelligence Index
32.8 Coding Index
57.1 Agentic Index
82.7 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 83%
Graduate-level scientific reasoning
HLE 22.2%
Humanity's Last Exam
MMLU Pro 87.5%
Multi-task language understanding
LiveCodeBench 81%
Live coding evaluation
SciCode 40.7%
Scientific computing
AIME 2025 82.7%
Competition mathematics (2025)
IFBench 69.9%
Instruction following
LCR 59%
Long-context reasoning
TerminalBench Hard 28.8%
Agentic terminal tasks
τ²-Bench 85.4%
Conversational agent benchmark

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID minimax/minimax-m2.1
Providerminimax
Release Date December 23, 2025
Context Length196,608 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.27 $0.000270
Output $0.95 $0.000950

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.7%
Avg Uptime
1,123ms
Best Latency (TTFT)
159 tok/s
Best Throughput
5/10
Active Endpoints
Available via: DeepInfra, AtlasCloud, SiliconFlow, Novita, Nebius, Fireworks, Minimax, Friendli +1 more

Leaderboard Categories