Qwen: Qwen2.5 7B Instruct

Qwen: Qwen2.5 7B Instruct

qwen · Released Oct 16, 2024 Legacy
Awaiting
Review
Benchmarks pending

Performance Profile

Intelligence0Technical0Value7.8Content4
Intelligence 0/10
Technical 0/10
Content 4/10
Value 7.8/10

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and mathematics, thanks to our specialized expert models in these domains. - Significant improvements in instruction following, generating long texts (over 8K tokens), understanding structured data (e.g, tables), and generating structured outputs especially JSON. More resilient to the diversity of system prompts, enhancing role-play implementation and condition-setting for chatbots. - Long-context Support up to 128K tokens and can generate up to 8K tokens. - Multilingual support for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more. Usage of this model is subject to Tongyi Qianwen LICENSE AGREEMENT.

$0.04 / 1M
Input Price
$0.10 / 1M
Output Price
32,768 tokens
Context Window
32,768 tokens
Max Output
7B Parameters

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerQwen
Instruct Typechatml
Parameters7B

Model Information

OpenRouter ID qwen/qwen-2.5-7b-instruct
Providerqwen
Release Date October 16, 2024
Context Length32,768 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.04 $0.000040
Output $0.10 $0.000100

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.4%
Avg Uptime
211ms
Best Latency (TTFT)
25 tok/s
Best Throughput
3/3
Active Endpoints
Available via: Phala, AtlasCloud, Together

Leaderboard Categories