Qwen: Qwen3.5-Flash
Analysis Summary
Qwen3.5-Flash is a lightweight model from Alibaba's Qwen team designed for high-throughput, cost-sensitive workloads. It supports text, image, and video inputs with tool use and function calling, and its 1M token context window is a practical advantage for large document processing. Pricing at $0.065/1M input is among the lowest in the Qwen family.
However, no benchmark data is available, so reasoning depth, coding capability, and instruction-following reliability cannot be assessed against the broader field. The capability flags suggest it could handle structured automation and multimodal tasks, but without measured performance, production deployment carries meaningful uncertainty.
At this price point, it may be worth piloting for bulk, lower-stakes tasks where cost is the primary driver. Businesses requiring validated performance should wait for benchmark data or use the benchmarked Qwen3.5-27B or 122B-A10B models for reliability-critical workflows. A -4 point regional accessibility adjustment applies.
Assessed June 6, 2026
Editorial notes
Qwen3.5-Flash supports vision, tool use, and function calling with a 1M token context window at very low cost, but has no benchmark data to validate its reasoning or task performance.
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
How Qwen: Qwen3.5-Flash compares
Its 1M-token context window is larger than 91% of the models we list. At $0.07 per million input tokens it is cheaper than 70% of comparable models.
About Qwen: Qwen3.5-Flash
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the..
Capabilities
How does Qwen: Qwen3.5-Flash stack up?
Compare side-by-side with other legacy models.
Model Information
| OpenRouter ID |
qwen/qwen3.5-flash-02-23
|
| Provider | qwen |
| Release Date | February 25, 2026 |
| Context Length | 1,000,000 tokens |
| Max Completion | 65,536 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.07 | $0.000065 |
| Output | $0.26 | $0.000260 |
Live Performance
Live endpoint metrics, refreshed every 30 minutes.
External Resources
Explore Related Models
Frequently asked questions about Qwen: Qwen3.5-Flash
How much does Qwen: Qwen3.5-Flash cost?
Qwen: Qwen3.5-Flash costs $0.07 per million input tokens and $0.26 per million output tokens.
What is the context window of Qwen: Qwen3.5-Flash?
Qwen: Qwen3.5-Flash has a context window of 1,000,000 tokens (1M).
What can Qwen: Qwen3.5-Flash do?
Qwen: Qwen3.5-Flash supports image/vision input, tool use, and function calling.
Who created Qwen: Qwen3.5-Flash?
Qwen: Qwen3.5-Flash is developed by Qwen and was released on February 25, 2026.
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: June 17, 2026 9:41 am