Qwen: Qwen3.5-Flash

Qwen: Qwen3.5-Flash

qwen · Released Feb 25, 2026 Emerging
Intelligence #300 / 544
28.0 Our Score
Speed
Not reported
Input #159 / 544
$0.065 per 1M tokens
Output #189 / 544
$0.260 per 1M tokens
Context #43 / 544
1M tokens

Analysis Summary

Qwen: Qwen3.5-Flash sits in the Emerging tier on our leaderboard, ranked #300 of 544 published models on overall intelligence. At $0.065 input and $0.260 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, function calling, vision, and reasoning.

Editorial notes

Qwen3.5-Flash supports vision and video with a 1M token context window at very low pricing, but has no benchmark data to assess reasoning or coding capability.

Assessed May 5, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence0Technical0Value8.3Content2.5
Intelligence 0/10
Technical 0/10
Content 2.5/10
Value 8.3/10

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the..

Capabilities

Tool Use Function Calling Vision

How does Qwen: Qwen3.5-Flash stack up?

Compare side-by-side with other emerging models.

Compare Models

Model Information

OpenRouter ID qwen/qwen3.5-flash-02-23
Providerqwen
Release Date February 25, 2026
Context Length1,000,000 tokens
Max Completion65,536 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.07 $0.000065
Output $0.26 $0.000260

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
542ms
Best Latency (TTFT)
81 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Alibaba