Qwen: Qwen Plus 0728 (thinking)

Qwen: Qwen Plus 0728 (thinking)

qwen · Released Sep 8, 2025 Legacy
Intelligence #398 / 576
28.0 Our Score
Speed
— Not reported
Input #314 / 576
$0.260 per 1M tokens
Output #295 / 576
$0.780 per 1M tokens
Context #51 / 576
1M tokens

Analysis Summary

Qwen Plus 0728 (thinking) is a reasoning-mode variant from Alibaba's Qwen family, offering a 1 million token context window with tool use and function calling. No benchmark data has been published for this specific variant, making it impossible to assess reasoning, coding, or instruction-following quality against the broader field.

The 1M context window is a standout feature that would be highly valuable for long-document analysis, large codebase review, or extended research workflows if the underlying model capability is strong. However, without benchmark evidence, adoption carries meaningful risk for client-facing or mission-critical applications.

This variant should be treated as experimental until independent results are available. Teams already using the Qwen Plus family may evaluate it for long-context use cases, but it cannot be recommended for primary business deployment at this stage.

Assessed June 6, 2026

Editorial notes

Qwen Plus 0728 (thinking) has no benchmark data available; the 1M token context window and tool use are notable features, but capability cannot be assessed without performance evidence.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence0Technical0Value8.3Content2.5
Intelligence 0/10
Technical 0/10
Content 2.5/10
Value 8.3/10

How Qwen: Qwen Plus 0728 (thinking) compares

Its 1M-token context window is larger than 91% of the models we list. At $0.26 per million input tokens it is cheaper than 45% of comparable models.

About Qwen: Qwen Plus 0728 (thinking)

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

Capabilities

Tool Use Function Calling

How does Qwen: Qwen Plus 0728 (thinking) stack up?

Compare side-by-side with other legacy models.

Compare Models

Model Information

OpenRouter ID qwen/qwen-plus-2025-07-28:thinking
Providerqwen
Release Date September 8, 2025
Context Length1,000,000 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.26 $0.000260
Output $0.78 $0.000780

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

357ms
Best Latency (TTFT)
82 tok/s
Best Throughput
0/1
Active Endpoints
Available via: Alibaba

Frequently asked questions about Qwen: Qwen Plus 0728 (thinking)

How much does Qwen: Qwen Plus 0728 (thinking) cost?

Qwen: Qwen Plus 0728 (thinking) costs $0.26 per million input tokens and $0.78 per million output tokens.

What is the context window of Qwen: Qwen Plus 0728 (thinking)?

Qwen: Qwen Plus 0728 (thinking) has a context window of 1,000,000 tokens (1M).

What can Qwen: Qwen Plus 0728 (thinking) do?

Qwen: Qwen Plus 0728 (thinking) supports tool use and function calling.

Who created Qwen: Qwen Plus 0728 (thinking)?

Qwen: Qwen Plus 0728 (thinking) is developed by Qwen and was released on September 8, 2025.