Qwen: Qwen3 VL 32B Instruct

Qwen: Qwen3 VL 32B Instruct

qwen · Released Oct 23, 2025 Professional
Intelligence #10 / 576
82.0 Our Score
Speed #165 / 271
75.4 tokens / sec
Input #222 / 577
$0.104 per 1M tokens
Output #246 / 577
$0.416 per 1M tokens
Context #108 / 577
262,144 tokens

Analysis Summary

Qwen3 VL 32B Instruct is Alibaba's larger vision-language model in the Qwen3 VL family, supporting text and image inputs with tool use and function calling across a 262K context window. Its intelligence index of 17.2 and coding index of 15.6 are modest in absolute terms but represent a meaningful step up from the 8B variants, with stronger math and livecodebench scores.

For businesses, it is suited to document analysis, image-based structured extraction, and content workflows where visual understanding and moderate reasoning are both required. The 262K context window supports long-document use cases. Agentic performance is limited, so it is not well suited to fully autonomous multi-step pipelines.

At $0.104 input and $0.416 output per million tokens, it is competitively priced for a multimodal model with this context capacity. Teams needing a cost-effective vision-capable model for structured content and analysis tasks will find it a practical choice, though those requiring stronger reasoning should consider larger alternatives.

Assessed June 6, 2026

Editorial notes

Qwen3 VL 32B Instruct is a capable mid-size vision-language model with tool use, function calling, and a 262K context window, offering stronger coding and agentic performance than the 8B variants at a competitive price.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.3Technical2.8Value8Content4
Intelligence 3.3/10
Technical 2.8/10
Content 4/10
Value 8/10

How Qwen: Qwen3 VL 32B Instruct compares

Qwen: Qwen3 VL 32B Instruct ranks #202 of 378 AI models we track for overall intelligence, #174 of 315 for coding, #187 of 289 for agentic tasks. Its 262K-token context window is larger than 81% of the models we list. At $0.10 per million input tokens it is cheaper than 62% of comparable models.

About Qwen: Qwen3 VL 32B Instruct

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text..

32B Parameters

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

17.2 Intelligence Index
15.6 Coding Index
18.8 Agentic Index
68.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 67.1% Graduate-level scientific reasoning
HLE 6.3% Humanity's Last Exam
MMLU Pro 79.1% Multi-task language understanding
AIME 2025 68.3% Competition mathematics (2025)
SciCode 30.1% Scientific computing

Technical

LiveCodeBench 51.4% Live coding evaluation
TerminalBench Hard 8.3% Agentic terminal tasks
τ²-Bench 29.2% Conversational agent benchmark

Content

IFBench 39.2% Instruction following
LCR 31.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen: Qwen3 VL 32B Instruct stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID qwen/qwen3-vl-32b-instruct
Providerqwen
Release Date October 23, 2025
Context Length262,144 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.10 $0.000104
Output $0.42 $0.000416

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

100%
Avg Uptime
515ms
Best Latency (TTFT)
52 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Alibaba

Leaderboard Categories

Frequently asked questions about Qwen: Qwen3 VL 32B Instruct

How much does Qwen: Qwen3 VL 32B Instruct cost?

Qwen: Qwen3 VL 32B Instruct costs $0.10 per million input tokens and $0.42 per million output tokens.

What is the context window of Qwen: Qwen3 VL 32B Instruct?

Qwen: Qwen3 VL 32B Instruct has a context window of 262,144 tokens (262K).

Is Qwen: Qwen3 VL 32B Instruct good for coding?

On our coding benchmark index, Qwen: Qwen3 VL 32B Instruct ranks #174 of 315 models, placing it in the broader range of the field for code generation and debugging.

What can Qwen: Qwen3 VL 32B Instruct do?

Qwen: Qwen3 VL 32B Instruct supports image/vision input, tool use, and function calling.

Who created Qwen: Qwen3 VL 32B Instruct?

Qwen: Qwen3 VL 32B Instruct is developed by Qwen and was released on October 23, 2025.