Qwen: Qwen2.5 7B Instruct

Qwen: Qwen2.5 7B Instruct

qwen · Released Oct 16, 2024
27
Our Score

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and mathematics, thanks to our specialized expert models in these domains. - Significant improvements in instruction following, generating long texts (over 8K tokens), understanding structured data (e.g, tables), and generating structured outputs especially JSON. More resilient to the diversity of system prompts, enhancing role-play implementation and condition-setting for chatbots. - Long-context Support up to 128K tokens and can generate up to 8K tokens. - Multilingual support for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more. Usage of this model is subject to Tongyi Qianwen LICENSE AGREEMENT.

$0.04 / 1M Input Price
$0.10 / 1M Output Price
32,768 tokens Context Window
7B Parameters

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerQwen
Instruct Typechatml
Parameters7B

Model Information

OpenRouter ID qwen/qwen-2.5-7b-instruct
Providerqwen
Release Date October 16, 2024
Context Length32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.04 $0.000040
Output $0.10 $0.000100

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.9%
Avg Uptime
399ms
Best Latency (TTFT)
42 tok/s
Best Throughput
3/3
Active Endpoints
Available via: Phala, AtlasCloud, Together