Home > AI Models > Qwen: Qwen Plus 0728 (thinking)

Qwen: Qwen Plus 0728 (thinking)

Name: Qwen: Qwen Plus 0728 (thinking) Review
Item: Qwen: Qwen Plus 0728 (thinking)
Author: Design for Online Editorial

Qwen: Qwen Plus 0728 (thinking)

qwen · Released Sep 8, 2025 Legacy

Intelligence #316 / 583

28.0 Our Score

Speed

— Not reported

Input #317 / 583

$0.260 per 1M tokens

Output #297 / 583

$0.780 per 1M tokens

Context #52 / 583

1M tokens

Qwen Plus 0728 (thinking) is a reasoning-mode variant from Alibaba's Qwen family, offering a 1 million token context window with tool use and function calling. No benchmark data has been published for this specific variant, making it impossible to assess reasoning, coding, or instruction-following quality against the broader field.

The 1M context window is a standout feature that would be highly valuable for long-document analysis, large codebase review, or extended research workflows if the underlying model capability is strong. However, without benchmark evidence, adoption carries meaningful risk for client-facing or mission-critical applications.

This variant should be treated as experimental until independent results are available. Teams already using the Qwen Plus family may evaluate it for long-context use cases, but it cannot be recommended for primary business deployment at this stage.

Assessed June 6, 2026

Editorial notes

Qwen Plus 0728 (thinking) has no benchmark data available; the 1M token context window and tool use are notable features, but capability cannot be assessed without performance evidence.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: Yes
Input
Output
Context: 1M tokens
Max output: 32,768 tokens
Tokenizer: Qwen3
Released: Sep 8, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

How Qwen: Qwen Plus 0728 (thinking) compares

Its 1M-token context window is larger than 91% of the models we list. At $0.26 per million input tokens it is cheaper than 46% of comparable models.

About Qwen: Qwen Plus 0728 (thinking)

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

Capabilities

Tool Use Function Calling

How does Qwen: Qwen Plus 0728 (thinking) stack up?

Compare side-by-side with other legacy models.

Compare Models

Model Information

OpenRouter ID	`qwen/qwen-plus-2025-07-28:thinking`
Provider	qwen
Release Date	September 8, 2025
Context Length	1,000,000 tokens
Max Completion	32,768 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.26	$0.000260
Output	$0.78	$0.000780

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

387ms

Best Latency (TTFT)

76 tok/s

Best Throughput

0/1

Active Endpoints

Available via: Alibaba

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Frequently asked questions about Qwen: Qwen Plus 0728 (thinking)

How much does Qwen: Qwen Plus 0728 (thinking) cost?

Qwen: Qwen Plus 0728 (thinking) costs $0.26 per million input tokens and $0.78 per million output tokens.

What is the context window of Qwen: Qwen Plus 0728 (thinking)?

Qwen: Qwen Plus 0728 (thinking) has a context window of 1,000,000 tokens (1M).

What can Qwen: Qwen Plus 0728 (thinking) do?

Qwen: Qwen Plus 0728 (thinking) supports tool use and function calling.

Who created Qwen: Qwen Plus 0728 (thinking)?

Qwen: Qwen Plus 0728 (thinking) is developed by Qwen and was released on September 8, 2025.

Qwen: Qwen Plus 0728 (thinking)

Performance Profile

How Qwen: Qwen Plus 0728 (thinking) compares

About Qwen: Qwen Plus 0728 (thinking)

Capabilities

Model Information

Pricing

Live Performance

External Resources

Explore Related Models

Frequently asked questions about Qwen: Qwen Plus 0728 (thinking)

How much does Qwen: Qwen Plus 0728 (thinking) cost?

What is the context window of Qwen: Qwen Plus 0728 (thinking)?

What can Qwen: Qwen Plus 0728 (thinking) do?

Who created Qwen: Qwen Plus 0728 (thinking)?