Qwen3 VL 4B (Reasoning)

Qwen3 VL 4B (Reasoning)

Alibaba · Released Oct 14, 2025 Professional
Intelligence #14 / 590
82.0 Our Score
AA Index #270 / 385
7.9 Artificial Analysis
Input
— Not priced
Output
— Not priced
Context
— Not reported

Analysis Summary

Qwen3 VL 4B Reasoning is the reasoning-mode variant of Alibaba's compact vision-language model, with an intelligence index of 7.9, GPQA of 0.494, and MMLU-Pro of 0.700. LiveCodeBench at 0.320 is a step up from the non-reasoning variant, and vision support remains a meaningful differentiator for multimodal tasks.

For businesses, it suits analytical visual tasks, structured content extraction from images, and moderate reasoning workflows where a small model footprint is valued. The agentic index drops to 8.5 compared to the non-reasoning variant's 23.4, which is a notable trade-off for teams considering agent pipelines. Long-context reliability (LCR 0.213) is modest.

A -4 point regional penalty applies. Teams needing a small multimodal model with stronger reasoning than the base variant will find this useful for analytical tasks, but the agentic trade-off means the non-reasoning variant may be preferable for tool-use workflows.

Assessed June 30, 2026

Editorial notes

Qwen3 VL 4B Reasoning from Alibaba adds a reasoning mode to the compact vision-language base, lifting GPQA to 0.494 and MMLU-Pro to 0.700, with multimodal support, though agentic capability drops versus the non-reasoning variant.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence2Technical1.2Value0Content3.3
Intelligence 2/10
Technical 1.2/10
Content 3.3/10
Value 0/10

How Qwen3 VL 4B (Reasoning) compares

Qwen3 VL 4B (Reasoning) ranks #270 of 385 AI models we track for overall intelligence, #269 of 293 for agentic tasks. Qwen3 VL 4B (Reasoning) is currently free to use via OpenRouter.

Performance Indices

Source: Artificial Analysis

7.9 Intelligence Index
8.5 Agentic Index
25.7 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 49.4% Graduate-level scientific reasoning
HLE 4.4% Humanity's Last Exam
MMLU Pro 70% Multi-task language understanding
AIME 2025 25.7% Competition mathematics (2025)
SciCode 17.1% Scientific computing

Technical

LiveCodeBench 32% Live coding evaluation
TerminalBench Hard 1.5% Agentic terminal tasks
τ²-Bench 15.5% Conversational agent benchmark

Content

IFBench 36.6% Instruction following
LCR 21.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen3 VL 4B (Reasoning) stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

ProviderAlibaba
Release Date October 14, 2025
Status Active

Leaderboard Categories

Frequently asked questions about Qwen3 VL 4B (Reasoning)

How much does Qwen3 VL 4B (Reasoning) cost?

Qwen3 VL 4B (Reasoning) is currently available for free via OpenRouter.

Who created Qwen3 VL 4B (Reasoning)?

Qwen3 VL 4B (Reasoning) is developed by Alibaba and was released on October 14, 2025.