Qwen: Qwen3.5-Flash

Qwen: Qwen3.5-Flash

qwen · Released Feb 25, 2026 Emerging
Awaiting
Review
Benchmarks pending

Performance Profile

Intelligence0Technical0Value8.3Content4
Intelligence 0/10
Technical 0/10
Content 4/10
Value 8.3/10

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.

$0.07 / 1M
Input Price
$0.26 / 1M
Output Price
1M tokens
Context Window
65,536 tokens
Max Output

Capabilities

Tool Use Function Calling Vision

Architecture

ModalityText + Image + Video → Text
TokenizerQwen3

Model Information

OpenRouter ID qwen/qwen3.5-flash-02-23
Providerqwen
Release Date February 25, 2026
Context Length1,000,000 tokens
Max Completion65,536 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.07 $0.000065
Output $0.26 $0.000260

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
585ms
Best Latency (TTFT)
89 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Alibaba