Qwen2.5 72B Instruct

Qwen2.5 72B Instruct

qwen · Released Sep 19, 2024 Efficient
44
Our Score

Performance Profile

Intelligence3.3Technical2.5Value7.5Content5
Intelligence 3.3/10
Technical 2.5/10
Content 5/10
Value 7.5/10

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and..

$0.12 / 1M
Input Price
$0.39 / 1M
Output Price
32,768 tokens
Context Window
16,384 tokens
Max Output
72B Parameters

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerQwen
Instruct Typechatml
Parameters72B

Performance Indices

Source: Artificial Analysis

15.6 Intelligence Index
11.9 Coding Index
19.5 Agentic Index
14 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 49.1% Graduate-level scientific reasoning
HLE 4.2% Humanity's Last Exam
MMLU Pro 72% Multi-task language understanding
MATH 500 85.8% Mathematical problem-solving
AIME 16% Competition mathematics
AIME 2025 14% Competition mathematics (2025)
SciCode 26.7% Scientific computing

Technical

LiveCodeBench 27.6% Live coding evaluation
TerminalBench Hard 4.5% Agentic terminal tasks
τ²-Bench 34.5% Conversational agent benchmark

Content

IFBench 36.9% Instruction following
LCR 20.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen2.5 72B Instruct stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID qwen/qwen-2.5-72b-instruct
Providerqwen
Release Date September 19, 2024
Context Length32,768 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.12 $0.000120
Output $0.39 $0.000390

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.9%
Avg Uptime
576ms
Best Latency (TTFT)
31 tok/s
Best Throughput
1/2
Active Endpoints
Available via: DeepInfra, Novita

Leaderboard Categories