Qwen: Qwen3 VL 32B Instruct

Qwen: Qwen3 VL 32B Instruct

qwen · Released Oct 23, 2025 Efficient
Intelligence #149 / 557
49.6 Our Score
Speed #171 / 259
62.8 tokens / sec
Input #217 / 560
$0.104 per 1M tokens
Output #239 / 560
$0.416 per 1M tokens
Context #99 / 560
262,144 tokens

Analysis Summary

Qwen: Qwen3 VL 32B Instruct sits in the Efficient tier on our leaderboard, ranked #149 of 557 published models on overall intelligence. At $0.104 input and $0.416 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use, function calling, vision, and reasoning.

Editorial notes

Qwen3 VL 32B Instruct combines vision, tool use, function calling, and a 262K context with the strongest coding index in the Qwen3 VL family at competitive pricing; intelligence index remains limited.

Assessed May 17, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.8Technical3.1Value8Content4.5
Intelligence 3.8/10
Technical 3.1/10
Content 4.5/10
Value 8/10

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text..

32B Parameters

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

17.2 Intelligence Index
15.6 Coding Index
18.8 Agentic Index
68.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 67.1% Graduate-level scientific reasoning
HLE 6.3% Humanity's Last Exam
MMLU Pro 79.1% Multi-task language understanding
AIME 2025 68.3% Competition mathematics (2025)
SciCode 30.1% Scientific computing

Technical

LiveCodeBench 51.4% Live coding evaluation
TerminalBench Hard 8.3% Agentic terminal tasks
τ²-Bench 29.2% Conversational agent benchmark

Content

IFBench 39.2% Instruction following
LCR 31.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen: Qwen3 VL 32B Instruct stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID qwen/qwen3-vl-32b-instruct
Providerqwen
Release Date October 23, 2025
Context Length262,144 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.10 $0.000104
Output $0.42 $0.000416

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
613ms
Best Latency (TTFT)
61 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Alibaba

Leaderboard Categories