Qwen: Qwen3.5-Flash

Qwen: Qwen3.5-Flash

qwen · Released Feb 25, 2026 Legacy
Intelligence #397 / 579
28.9 Our Score
Speed
— Not reported
Input #171 / 579
$0.065 per 1M tokens
Output #202 / 579
$0.260 per 1M tokens
Context #52 / 579
1M tokens

Analysis Summary

Qwen3.5-Flash is a lightweight model from Alibaba's Qwen team designed for high-throughput, cost-sensitive workloads. It supports text, image, and video inputs with tool use and function calling, and its 1M token context window is a practical advantage for large document processing. Pricing at $0.065/1M input is among the lowest in the Qwen family.

However, no benchmark data is available, so reasoning depth, coding capability, and instruction-following reliability cannot be assessed against the broader field. The capability flags suggest it could handle structured automation and multimodal tasks, but without measured performance, production deployment carries meaningful uncertainty.

At this price point, it may be worth piloting for bulk, lower-stakes tasks where cost is the primary driver. Businesses requiring validated performance should wait for benchmark data or use the benchmarked Qwen3.5-27B or 122B-A10B models for reliability-critical workflows. A -4 point regional accessibility adjustment applies.

Assessed June 6, 2026

Editorial notes

Qwen3.5-Flash supports vision, tool use, and function calling with a 1M token context window at very low cost, but has no benchmark data to validate its reasoning or task performance.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence0Technical0Value8.3Content3
Intelligence 0/10
Technical 0/10
Content 3/10
Value 8.3/10

How Qwen: Qwen3.5-Flash compares

Its 1M-token context window is larger than 91% of the models we list. At $0.07 per million input tokens it is cheaper than 70% of comparable models.

About Qwen: Qwen3.5-Flash

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the..

Capabilities

Tool Use Function Calling Vision

How does Qwen: Qwen3.5-Flash stack up?

Compare side-by-side with other legacy models.

Compare Models

Model Information

OpenRouter ID qwen/qwen3.5-flash-02-23
Providerqwen
Release Date February 25, 2026
Context Length1,000,000 tokens
Max Completion65,536 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.07 $0.000065
Output $0.26 $0.000260

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

100%
Avg Uptime
639ms
Best Latency (TTFT)
74.5 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Alibaba

Frequently asked questions about Qwen: Qwen3.5-Flash

How much does Qwen: Qwen3.5-Flash cost?

Qwen: Qwen3.5-Flash costs $0.07 per million input tokens and $0.26 per million output tokens.

What is the context window of Qwen: Qwen3.5-Flash?

Qwen: Qwen3.5-Flash has a context window of 1,000,000 tokens (1M).

What can Qwen: Qwen3.5-Flash do?

Qwen: Qwen3.5-Flash supports image/vision input, tool use, and function calling.

Who created Qwen: Qwen3.5-Flash?

Qwen: Qwen3.5-Flash is developed by Qwen and was released on February 25, 2026.