Home > AI Models > Qwen2.5 72B Instruct

Qwen2.5 72B Instruct

Name: Qwen2.5 72B Instruct Review
Item: Qwen2.5 72B Instruct
Author: Design for Online Editorial

Qwen2.5 72B Instruct

qwen · Released Sep 19, 2024 Professional

Intelligence #14 / 590

82.0 Our Score

Speed #198 / 279

55.7 tokens / sec

Input #359 / 592

$0.360 per 1M tokens

Output #229 / 592

$0.400 per 1M tokens

Context #245 / 592

131,072 tokens

Qwen2.5 72B Instruct is Alibaba's mid-2024 flagship open-weight model, offering tool use and function calling alongside a 131K context window. Its intelligence index of 9.6 places it in the lower tier of benchmarked models, but its MMLU-Pro score of 0.72 and GPQA of 0.491 indicate reasonable general knowledge and reasoning for its class.

For businesses, it suits cost-sensitive workloads such as structured content generation, SEO copy, and lightweight tool-calling pipelines. The very low pricing ($0.36 input, $0.40 output per 1M tokens) makes it attractive for high-volume automation. Agentic performance is limited, and coding capability is modest, so it is not suited to complex software engineering or multi-step agent tasks.

Teams looking for an affordable, capable model for content and structured output workflows will find it a practical option, particularly where budget constraints rule out frontier models.

Assessed June 30, 2026

Editorial notes

Qwen2.5 72B Instruct offers strong instruction following, tool use, and function calling at very competitive pricing, though its intelligence index places it well below current frontier models.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: No
Input
Output
Context: 131,072 tokens
Max output: 16,384 tokens
Tokenizer: Qwen
Released: Sep 19, 2024

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

How Qwen2.5 72B Instruct compares

Qwen2.5 72B Instruct ranks #236 of 385 AI models we track for overall intelligence, #185 of 293 for agentic tasks. Its 131K-token context window is larger than 59% of the models we list. At $0.36 per million input tokens it is cheaper than 39% of comparable models.

About Qwen2.5 72B Instruct

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and..

72B Parameters

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Type chatml

Performance Indices

Source: Artificial Analysis

9.6 Intelligence Index

19.5 Agentic Index

14 Math Index

Benchmark Scores

GPQA Diamond 49.1% Graduate-level scientific reasoning

HLE 4.2% Humanity's Last Exam

MMLU Pro 72% Multi-task language understanding

MATH 500 85.8% Mathematical problem-solving

AIME 16% Competition mathematics

AIME 2025 14% Competition mathematics (2025)

SciCode 26.7% Scientific computing

LiveCodeBench 27.6% Live coding evaluation

TerminalBench Hard 4.5% Agentic terminal tasks

τ²-Bench 34.5% Conversational agent benchmark

IFBench 36.9% Instruction following

LCR 20.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen2.5 72B Instruct stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID	`qwen/qwen-2.5-72b-instruct`
Provider	qwen
Release Date	September 19, 2024
Context Length	131,072 tokens
Max Completion	16,384 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.36	$0.000360
Output	$0.40	$0.000400

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

91.9%

Avg Uptime

690ms

Best Latency (TTFT)

24 tok/s

Best Throughput

2/2

Active Endpoints

Available via: DeepInfra, Novita

Leaderboard Categories

SEO Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Frequently asked questions about Qwen2.5 72B Instruct

How much does Qwen2.5 72B Instruct cost?

Qwen2.5 72B Instruct costs $0.36 per million input tokens and $0.40 per million output tokens.

What is the context window of Qwen2.5 72B Instruct?

Qwen2.5 72B Instruct has a context window of 131,072 tokens (131K).

What can Qwen2.5 72B Instruct do?

Qwen2.5 72B Instruct supports tool use and function calling.

Who created Qwen2.5 72B Instruct?

Qwen2.5 72B Instruct is developed by Qwen and was released on September 19, 2024.

Qwen2.5 72B Instruct

Qwen2.5 72B Instruct

Analysis Summary

Performance Profile

How Qwen2.5 72B Instruct compares

About Qwen2.5 72B Instruct

Capabilities

Architecture Detail

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Frequently asked questions about Qwen2.5 72B Instruct

How much does Qwen2.5 72B Instruct cost?

What is the context window of Qwen2.5 72B Instruct?

What can Qwen2.5 72B Instruct do?

Who created Qwen2.5 72B Instruct?

Qwen2.5 72B Instruct

Performance Profile

How Qwen2.5 72B Instruct compares

About Qwen2.5 72B Instruct

Capabilities

Architecture Detail

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Explore Related Models

Frequently asked questions about Qwen2.5 72B Instruct

How much does Qwen2.5 72B Instruct cost?

What is the context window of Qwen2.5 72B Instruct?

What can Qwen2.5 72B Instruct do?

Who created Qwen2.5 72B Instruct?