Qwen: Qwen3.5-Flash
Analysis Summary
Qwen: Qwen3.5-Flash sits in the Emerging tier on our leaderboard, ranked #293 of 561 published models on overall intelligence. At $0.065 input and $0.260 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, function calling, vision, and reasoning.
Editorial notes
Qwen3.5-Flash offers a 1M token context with vision and tool use at low pricing, but has no benchmark data to validate capability at this stage.
Assessed May 14, 2026
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the..
Capabilities
How does Qwen: Qwen3.5-Flash stack up?
Compare side-by-side with other emerging models.
Model Information
| OpenRouter ID |
qwen/qwen3.5-flash-02-23
|
| Provider | qwen |
| Release Date | February 25, 2026 |
| Context Length | 1,000,000 tokens |
| Max Completion | 65,536 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.07 | $0.000065 |
| Output | $0.26 | $0.000260 |
Live Performance
Live endpoint metrics — refreshed every 30 minutes.
External Resources
Explore Related Models
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: May 23, 2026 8:38 pm