Qwen: Qwen2.5 7B Instruct

Qwen: Qwen2.5 7B Instruct

qwen · Released Oct 16, 2024 Legacy
Intelligence #355 / 583
26.0 Our Score
Speed
— Not reported
Input #150 / 583
$0.040 per 1M tokens
Output #140 / 583
$0.100 per 1M tokens
Context #240 / 583
131,072 tokens

Analysis Summary

Qwen2.5 7B Instruct is Alibaba's 7B parameter instruction-tuned model, offering a 128K context window at very competitive pricing. No benchmark data is attached to this listing, so reasoning and coding capability cannot be verified directly, though the Qwen2.5 family generally performs well for its size class.

For businesses, the lack of tool use, function calling, or vision support limits its utility for agentic or multimodal workflows. It may serve well for lightweight summarisation, translation, or structured text tasks where cost efficiency is the priority.

At $0.04 input and $0.10 output per million tokens, it is among the most affordable options in the database. A provider accessibility penalty applies given limited enterprise adoption in enterprises. Teams comfortable with self-hosting or API access through third-party providers may find it a cost-effective option for high-volume, low-complexity tasks.

Assessed June 6, 2026

Editorial notes

Qwen2.5 7B Instruct is a compact Alibaba model with a 128K context window and very low pricing, but has no benchmark data in this listing and carries a provider accessibility penalty.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence0Technical0Value8Content3.5
Intelligence 0/10
Technical 0/10
Content 3.5/10
Value 8/10

How Qwen: Qwen2.5 7B Instruct compares

Its 131K-token context window is larger than 59% of the models we list. At $0.04 per million input tokens it is cheaper than 74% of comparable models.

About Qwen: Qwen2.5 7B Instruct

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and..

7B Parameters

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Typechatml

How does Qwen: Qwen2.5 7B Instruct stack up?

Compare side-by-side with other legacy models.

Compare Models

Model Information

OpenRouter ID qwen/qwen-2.5-7b-instruct
Providerqwen
Release Date October 16, 2024
Context Length131,072 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.04 $0.000040
Output $0.10 $0.000100

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

99.8%
Avg Uptime
385ms
Best Latency (TTFT)
75 tok/s
Best Throughput
2/2
Active Endpoints
Available via: Phala, Together

Leaderboard Categories

Frequently asked questions about Qwen: Qwen2.5 7B Instruct

How much does Qwen: Qwen2.5 7B Instruct cost?

Qwen: Qwen2.5 7B Instruct costs $0.04 per million input tokens and $0.10 per million output tokens.

What is the context window of Qwen: Qwen2.5 7B Instruct?

Qwen: Qwen2.5 7B Instruct has a context window of 131,072 tokens (131K).

What can Qwen: Qwen2.5 7B Instruct do?

Qwen: Qwen2.5 7B Instruct supports tool use and function calling.

Who created Qwen: Qwen2.5 7B Instruct?

Qwen: Qwen2.5 7B Instruct is developed by Qwen and was released on October 16, 2024.