MoonshotAI: Kimi K2 Thinking

MoonshotAI: Kimi K2 Thinking

moonshotai · Released Nov 6, 2025 Professional
Intelligence #49 / 523
72.3 Our Score
Speed #146 / 236
72.2 tokens / sec
Input #359 / 523
$0.600 per 1M tokens
Output #376 / 523
$2.50 per 1M tokens
Context #78 / 523
262,144 tokens

Analysis Summary

MoonshotAI: Kimi K2 Thinking sits in the Professional tier on our leaderboard, ranked #49 of 523 published models on overall intelligence. At $0.600 input and $2.50 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use, function calling, and reasoning.

Editorial notes

MoonshotAI's Kimi K2 Thinking delivers excellent reasoning, strong agentic performance, and impressive maths and coding benchmarks at a very reasonable price point. With tool use, function calling, and a 262K context window, it is a compelling option for businesses needing deep reasoning and agent-based workflows, though Western enterprise support is more limited than top-tier providers.

Assessed April 23, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence7.3Technical7.2Value7Content7.5
Intelligence 7.3/10
Technical 7.2/10
Content 7.5/10
Value 7/10

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in..

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

40.9 Intelligence Index
34.8 Coding Index
62.1 Agentic Index
94.7 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 83.8% Graduate-level scientific reasoning
HLE 22.3% Humanity's Last Exam
MMLU Pro 84.8% Multi-task language understanding
AIME 2025 94.7% Competition mathematics (2025)
SciCode 42.4% Scientific computing

Technical

LiveCodeBench 85.3% Live coding evaluation
TerminalBench Hard 31.1% Agentic terminal tasks
τ²-Bench 93% Conversational agent benchmark

Content

IFBench 68.1% Instruction following
LCR 66.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does MoonshotAI: Kimi K2 Thinking stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID moonshotai/kimi-k2-thinking
Providermoonshotai
Release Date November 6, 2025
Context Length262,144 tokens
Max Completion262,144 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.60 $0.000600
Output $2.50 $0.002500

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

90.2%
Avg Uptime
751ms
Best Latency (TTFT)
25 tok/s
Best Throughput
3/3
Active Endpoints
Available via: Novita, Google, AtlasCloud

Leaderboard Categories