Qwen: Qwen3.5-Flash

Qwen: Qwen3.5-Flash

qwen · Released Feb 25, 2026 New
40
Our Score

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.

$0.10 / 1M Input Price
$0.40 / 1M Output Price
1M tokens Context Window
65,536 tokens Max Output

Capabilities

Tool Use Function Calling Vision

Architecture

ModalityText + Image + Video → Text
TokenizerQwen3

Model Information

OpenRouter ID qwen/qwen3.5-flash-02-23
Providerqwen
Release Date February 25, 2026
Context Length1,000,000 tokens
Max Completion65,536 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.10 $0.000100
Output $0.40 $0.000400

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
518ms
Best Latency (TTFT)
86 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Alibaba