Qwen: Qwen3 Max Thinking

Qwen: Qwen3 Max Thinking

qwen · Released Feb 9, 2026 Specialist
61.3
Our Score

Performance Profile

Intelligence6Technical5.3Value7Content6.5
Intelligence 6/10
Technical 5.3/10
Content 6.5/10
Value 7/10

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it..

$0.78 / 1M
Input Price
$3.90 / 1M
Output Price
262,144 tokens
Context Window
32,768 tokens
Max Output

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerQwen

Performance Indices

Source: Artificial Analysis

32.5 Intelligence Index
24.5 Coding Index
50.5 Agentic Index
82.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 77.6% Graduate-level scientific reasoning
HLE 12% Humanity's Last Exam
MMLU Pro 82.4% Multi-task language understanding
AIME 2025 82.3% Competition mathematics (2025)
SciCode 38.7% Scientific computing

Technical

LiveCodeBench 53.5% Live coding evaluation
TerminalBench Hard 17.4% Agentic terminal tasks
τ²-Bench 83.6% Conversational agent benchmark

Content

IFBench 53.8% Instruction following
LCR 57.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen: Qwen3 Max Thinking stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID qwen/qwen3-max-thinking
Providerqwen
Release Date February 9, 2026
Context Length262,144 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.78 $0.000780
Output $3.90 $0.003900

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
1,164ms
Best Latency (TTFT)
32 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Alibaba

Leaderboard Categories