Home > AI Models > Qwen: Qwen2.5 7B Instruct

Qwen: Qwen2.5 7B Instruct

Name: Qwen: Qwen2.5 7B Instruct Review
Item: Qwen: Qwen2.5 7B Instruct
Author: Design for Online Editorial

Qwen: Qwen2.5 7B Instruct

qwen · Released Oct 16, 2024 Legacy

Intelligence #355 / 583

26.0 Our Score

Speed

— Not reported

Input #150 / 583

$0.040 per 1M tokens

Output #140 / 583

$0.100 per 1M tokens

Context #240 / 583

131,072 tokens

Qwen2.5 7B Instruct is Alibaba's 7B parameter instruction-tuned model, offering a 128K context window at very competitive pricing. No benchmark data is attached to this listing, so reasoning and coding capability cannot be verified directly, though the Qwen2.5 family generally performs well for its size class.

For businesses, the lack of tool use, function calling, or vision support limits its utility for agentic or multimodal workflows. It may serve well for lightweight summarisation, translation, or structured text tasks where cost efficiency is the priority.

At $0.04 input and $0.10 output per million tokens, it is among the most affordable options in the database. A provider accessibility penalty applies given limited enterprise adoption in enterprises. Teams comfortable with self-hosting or API access through third-party providers may find it a cost-effective option for high-volume, low-complexity tasks.

Assessed June 6, 2026

Editorial notes

Qwen2.5 7B Instruct is a compact Alibaba model with a 128K context window and very low pricing, but has no benchmark data in this listing and carries a provider accessibility penalty.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: No
Input
Output
Context: 131,072 tokens
Max output: 32,768 tokens
Tokenizer: Qwen
Released: Oct 16, 2024

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

How Qwen: Qwen2.5 7B Instruct compares

Its 131K-token context window is larger than 59% of the models we list. At $0.04 per million input tokens it is cheaper than 74% of comparable models.

About Qwen: Qwen2.5 7B Instruct

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and..

7B Parameters

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Type chatml

How does Qwen: Qwen2.5 7B Instruct stack up?

Compare side-by-side with other legacy models.

Compare Models

Model Information

OpenRouter ID	`qwen/qwen-2.5-7b-instruct`
Provider	qwen
Release Date	October 16, 2024
Context Length	131,072 tokens
Max Completion	32,768 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.04	$0.000040
Output	$0.10	$0.000100

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

99.8%

Avg Uptime

385ms

Best Latency (TTFT)

75 tok/s

Best Throughput

2/2

Active Endpoints

Available via: Phala, Together

Leaderboard Categories

Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Frequently asked questions about Qwen: Qwen2.5 7B Instruct

How much does Qwen: Qwen2.5 7B Instruct cost?

Qwen: Qwen2.5 7B Instruct costs $0.04 per million input tokens and $0.10 per million output tokens.

What is the context window of Qwen: Qwen2.5 7B Instruct?

Qwen: Qwen2.5 7B Instruct has a context window of 131,072 tokens (131K).

What can Qwen: Qwen2.5 7B Instruct do?

Qwen: Qwen2.5 7B Instruct supports tool use and function calling.

Who created Qwen: Qwen2.5 7B Instruct?

Qwen: Qwen2.5 7B Instruct is developed by Qwen and was released on October 16, 2024.

Qwen: Qwen2.5 7B Instruct

Performance Profile

How Qwen: Qwen2.5 7B Instruct compares

About Qwen: Qwen2.5 7B Instruct

Capabilities

Architecture Detail

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Explore Related Models

Frequently asked questions about Qwen: Qwen2.5 7B Instruct

How much does Qwen: Qwen2.5 7B Instruct cost?

What is the context window of Qwen: Qwen2.5 7B Instruct?

What can Qwen: Qwen2.5 7B Instruct do?

Who created Qwen: Qwen2.5 7B Instruct?