Qwen: Qwen3.5-Flash
Analysis Summary
Qwen: Qwen3.5-Flash sits in the Emerging tier on our leaderboard, ranked #300 of 544 published models on overall intelligence. At $0.065 input and $0.260 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, function calling, vision, and reasoning.
Editorial notes
Qwen3.5-Flash supports vision and video with a 1M token context window at very low pricing, but has no benchmark data to assess reasoning or coding capability.
Assessed May 5, 2026
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the..
Capabilities
How does Qwen: Qwen3.5-Flash stack up?
Compare side-by-side with other emerging models.
Model Information
| OpenRouter ID |
qwen/qwen3.5-flash-02-23
|
| Provider | qwen |
| Release Date | February 25, 2026 |
| Context Length | 1,000,000 tokens |
| Max Completion | 65,536 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.07 | $0.000065 |
| Output | $0.26 | $0.000260 |
Live Performance
Live endpoint metrics — refreshed every 30 minutes.
External Resources
Explore Related Models
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: May 5, 2026 11:06 am