Qwen: Qwen3 VL 235B A22B Instruct

Qwen: Qwen3 VL 235B A22B Instruct

qwen · Released Sep 23, 2025 Professional
Intelligence #14 / 590
82.0 Our Score
Speed #223 / 279
50.4 tokens / sec
Input #277 / 592
$0.200 per 1M tokens
Output #318 / 592
$0.880 per 1M tokens
Context #112 / 592
262,144 tokens

Analysis Summary

Qwen3 VL 235B A22B Instruct is Alibaba's large multimodal model, supporting text and image inputs with tool use and function calling across a 262K context window. Its intelligence index of 14.3 and agentic index of 21 indicate limited reasoning depth compared to frontier models, but MMLU Pro at 0.823 and GPQA at 0.712 show reasonable general knowledge coverage.

For businesses, this model suits cost-sensitive multimodal workflows: image analysis, document understanding, and structured content tasks where vision capability matters more than frontier reasoning. The low pricing ($0.20 input / $0.88 output) makes it viable for higher-volume use cases. Agentic reliability is limited, so complex multi-step tool use is not a strong fit.

A -4 point regional accessibility adjustment applies given the provider's limited enterprise footprint outside its home market. Teams needing a proven multimodal model with strong support should consider Google Gemini Flash variants; Qwen3 VL suits budget-conscious teams comfortable with self-evaluation.

Assessed June 30, 2026

Editorial notes

Qwen3 VL 235B A22B Instruct from Alibaba combines vision, tool use, and a 262K context at very competitive pricing, though reasoning and agentic benchmarks are modest.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.1Technical3.1Value8Content4.2
Intelligence 3.1/10
Technical 3.1/10
Content 4.2/10
Value 8/10

How Qwen: Qwen3 VL 235B A22B Instruct compares

Qwen: Qwen3 VL 235B A22B Instruct ranks #181 of 385 AI models we track for overall intelligence, #178 of 293 for agentic tasks. Its 262K-token context window is larger than 81% of the models we list. At $0.20 per million input tokens it is cheaper than 53% of comparable models.

About Qwen: Qwen3 VL 235B A22B Instruct

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table..

235B Parameters

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

14.3 Intelligence Index
21 Agentic Index
70.7 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 71.2% Graduate-level scientific reasoning
HLE 6.3% Humanity's Last Exam
MMLU Pro 82.3% Multi-task language understanding
AIME 2025 70.7% Competition mathematics (2025)
SciCode 35.9% Scientific computing

Technical

LiveCodeBench 59.4% Live coding evaluation
TerminalBench Hard 6.8% Agentic terminal tasks
τ²-Bench 35.1% Conversational agent benchmark

Content

IFBench 42.7% Instruction following
LCR 31.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen: Qwen3 VL 235B A22B Instruct stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID qwen/qwen3-vl-235b-a22b-instruct
Providerqwen
Release Date September 23, 2025
Context Length262,144 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.20 $0.000200
Output $0.88 $0.000880

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

92%
Avg Uptime
986ms
Best Latency (TTFT)
36 tok/s
Best Throughput
5/5
Active Endpoints
Available via: DeepInfra, Venice, Parasail, Alibaba, Novita

Leaderboard Categories

Frequently asked questions about Qwen: Qwen3 VL 235B A22B Instruct

How much does Qwen: Qwen3 VL 235B A22B Instruct cost?

Qwen: Qwen3 VL 235B A22B Instruct costs $0.20 per million input tokens and $0.88 per million output tokens.

What is the context window of Qwen: Qwen3 VL 235B A22B Instruct?

Qwen: Qwen3 VL 235B A22B Instruct has a context window of 262,144 tokens (262K).

What can Qwen: Qwen3 VL 235B A22B Instruct do?

Qwen: Qwen3 VL 235B A22B Instruct supports image/vision input, tool use, and function calling.

Who created Qwen: Qwen3 VL 235B A22B Instruct?

Qwen: Qwen3 VL 235B A22B Instruct is developed by Qwen and was released on September 23, 2025.