MoonshotAI: Kimi K2 0711

MoonshotAI: Kimi K2 0711

moonshotai · Released Jul 11, 2025 Specialist
Intelligence #118 / 525
53.6 Our Score
Speed #222 / 244
34.3 tokens / sec
Input #359 / 525
$0.570 per 1M tokens
Output #374 / 525
$2.30 per 1M tokens
Context #185 / 525
131,072 tokens

Analysis Summary

MoonshotAI: Kimi K2 0711 sits in the Specialist tier on our leaderboard, ranked #118 of 525 published models on overall intelligence. At $0.570 input and $2.30 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports tool use, function calling, and reasoning.

Editorial notes

Kimi K2 0711 from MoonshotAI shows a notably strong agentic score relative to its intelligence index, with solid tool use and function calling support making it a reasonable choice for agent workflows; however, limited Western availability may affect business adoption for UK teams.

Assessed April 23, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence5.1Technical4.6Value6.8Content5
Intelligence 5.1/10
Technical 4.6/10
Content 5/10
Value 6.8/10

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for..

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

26.3 Intelligence Index
22.1 Coding Index
38.5 Agentic Index
57 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 76.6% Graduate-level scientific reasoning
HLE 7% Humanity's Last Exam
MMLU Pro 82.4% Multi-task language understanding
MATH 500 97.1% Mathematical problem-solving
AIME 69.3% Competition mathematics
AIME 2025 57% Competition mathematics (2025)
SciCode 34.5% Scientific computing

Technical

LiveCodeBench 55.6% Live coding evaluation
TerminalBench Hard 15.9% Agentic terminal tasks
τ²-Bench 61.1% Conversational agent benchmark

Content

IFBench 41.5% Instruction following
LCR 51% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does MoonshotAI: Kimi K2 0711 stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID moonshotai/kimi-k2
Providermoonshotai
Release Date July 11, 2025
Context Length131,072 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.57 $0.000570
Output $2.30 $0.002300

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
1,195ms
Best Latency (TTFT)
26 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Novita

Leaderboard Categories