Qwen: Qwen3 VL 235B A22B Instruct
Analysis Summary
Qwen3 VL 235B A22B Instruct is Alibaba's large multimodal model, supporting text and image inputs with tool use and function calling across a 262K context window. Its intelligence index of 14.3 and agentic index of 21 indicate limited reasoning depth compared to frontier models, but MMLU Pro at 0.823 and GPQA at 0.712 show reasonable general knowledge coverage.
For businesses, this model suits cost-sensitive multimodal workflows: image analysis, document understanding, and structured content tasks where vision capability matters more than frontier reasoning. The low pricing ($0.20 input / $0.88 output) makes it viable for higher-volume use cases. Agentic reliability is limited, so complex multi-step tool use is not a strong fit.
A -4 point regional accessibility adjustment applies given the provider's limited enterprise footprint outside its home market. Teams needing a proven multimodal model with strong support should consider Google Gemini Flash variants; Qwen3 VL suits budget-conscious teams comfortable with self-evaluation.
Assessed June 30, 2026
Editorial notes
Qwen3 VL 235B A22B Instruct from Alibaba combines vision, tool use, and a 262K context at very competitive pricing, though reasoning and agentic benchmarks are modest.
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
How Qwen: Qwen3 VL 235B A22B Instruct compares
Qwen: Qwen3 VL 235B A22B Instruct ranks #181 of 385 AI models we track for overall intelligence, #178 of 293 for agentic tasks. Its 262K-token context window is larger than 81% of the models we list. At $0.20 per million input tokens it is cheaper than 53% of comparable models.
About Qwen: Qwen3 VL 235B A22B Instruct
Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table..
Capabilities
Performance Indices
Source: Artificial Analysis
Benchmark Scores
Intelligence
Technical
Content
Benchmark data from Artificial Analysis and Hugging Face
How does Qwen: Qwen3 VL 235B A22B Instruct stack up?
Compare side-by-side with other professional models.
Model Information
| OpenRouter ID |
qwen/qwen3-vl-235b-a22b-instruct
|
| Provider | qwen |
| Release Date | September 23, 2025 |
| Context Length | 262,144 tokens |
| Max Completion | 16,384 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.20 | $0.000200 |
| Output | $0.88 | $0.000880 |
Live Performance
Live endpoint metrics, refreshed every 30 minutes.
Leaderboard Categories
External Resources
Explore Related Models
Frequently asked questions about Qwen: Qwen3 VL 235B A22B Instruct
How much does Qwen: Qwen3 VL 235B A22B Instruct cost?
Qwen: Qwen3 VL 235B A22B Instruct costs $0.20 per million input tokens and $0.88 per million output tokens.
What is the context window of Qwen: Qwen3 VL 235B A22B Instruct?
Qwen: Qwen3 VL 235B A22B Instruct has a context window of 262,144 tokens (262K).
What can Qwen: Qwen3 VL 235B A22B Instruct do?
Qwen: Qwen3 VL 235B A22B Instruct supports image/vision input, tool use, and function calling.
Who created Qwen: Qwen3 VL 235B A22B Instruct?
Qwen: Qwen3 VL 235B A22B Instruct is developed by Qwen and was released on September 23, 2025.
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: July 2, 2026 8:38 pm