Qwen: Qwen3 Next 80B A3B Thinking

Qwen: Qwen3 Next 80B A3B Thinking

qwen · Released Sep 11, 2025
48
Our Score

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic planning, and reports strong results across knowledge, reasoning, coding, alignment, and multilingual evaluations. Compared with prior Qwen3 variants, it emphasizes stability under long chains of thought and efficient scaling during inference, and it is tuned to follow complex instructions while reducing repetitive or off-task behavior. The model is suitable for agent frameworks and tool use (function calling), retrieval-heavy workflows, and standardized benchmarking where step-by-step solutions are required. It supports long, detailed completions and leverages throughput-oriented techniques (e.g., multi-token prediction) for faster generation. Note that it operates in thinking-only mode.

$0.10 / 1M Input Price
$0.78 / 1M Output Price
131,072 tokens Context Window
32,768 tokens Max Output
80B Parameters

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerQwen3
Parameters80B

Performance Indices

Source: Artificial Analysis

26.7 Intelligence Index
19.5 Coding Index
25.7 Agentic Index
84.3 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 75.9%
Graduate-level scientific reasoning
HLE 11.7%
Humanity's Last Exam
MMLU Pro 82.4%
Multi-task language understanding
LiveCodeBench 78.4%
Live coding evaluation
SciCode 38.8%
Scientific computing
AIME 2025 84.3%
Competition mathematics (2025)
IFBench 60.7%
Instruction following
LCR 60.3%
Long-context reasoning
TerminalBench Hard 9.8%
Agentic terminal tasks
τ²-Bench 41.5%
Conversational agent benchmark

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID qwen/qwen3-next-80b-a3b-thinking
Providerqwen
Release Date September 11, 2025
Context Length131,072 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.10 $0.000098
Output $0.78 $0.000780

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
247ms
Best Latency (TTFT)
217 tok/s
Best Throughput
1/6
Active Endpoints
Available via: Alibaba, Nebius, Google, Novita, AtlasCloud, Hyperbolic

Leaderboard Categories