Home > AI Models > Qwen3 VL 4B (Reasoning)

Qwen3 VL 4B (Reasoning)

Name: Qwen3 VL 4B (Reasoning) Review
Item: Qwen3 VL 4B (Reasoning)
Author: Design for Online Editorial

Qwen3 VL 4B (Reasoning)

Alibaba · Released Oct 14, 2025 Professional

Intelligence #14 / 590

82.0 Our Score

AA Index #270 / 385

7.9 Artificial Analysis

Input

— Not priced

Output

— Not priced

Context

— Not reported

Qwen3 VL 4B Reasoning is the reasoning-mode variant of Alibaba's compact vision-language model, with an intelligence index of 7.9, GPQA of 0.494, and MMLU-Pro of 0.700. LiveCodeBench at 0.320 is a step up from the non-reasoning variant, and vision support remains a meaningful differentiator for multimodal tasks.

For businesses, it suits analytical visual tasks, structured content extraction from images, and moderate reasoning workflows where a small model footprint is valued. The agentic index drops to 8.5 compared to the non-reasoning variant's 23.4, which is a notable trade-off for teams considering agent pipelines. Long-context reliability (LCR 0.213) is modest.

A -4 point regional penalty applies. Teams needing a small multimodal model with stronger reasoning than the base variant will find this useful for analytical tasks, but the agentic trade-off means the non-reasoning variant may be preferable for tool-use workflows.

Assessed June 30, 2026

Editorial notes

Qwen3 VL 4B Reasoning from Alibaba adds a reasoning mode to the compact vision-language base, lifting GPQA to 0.494 and MMLU-Pro to 0.700, with multimodal support, though agentic capability drops versus the non-reasoning variant.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

How Qwen3 VL 4B (Reasoning) compares

Qwen3 VL 4B (Reasoning) ranks #270 of 385 AI models we track for overall intelligence, #269 of 293 for agentic tasks. Qwen3 VL 4B (Reasoning) is currently free to use via OpenRouter.

Performance Indices

Source: Artificial Analysis

7.9 Intelligence Index

8.5 Agentic Index

25.7 Math Index

Benchmark Scores

GPQA Diamond 49.4% Graduate-level scientific reasoning

HLE 4.4% Humanity's Last Exam

MMLU Pro 70% Multi-task language understanding

AIME 2025 25.7% Competition mathematics (2025)

SciCode 17.1% Scientific computing

LiveCodeBench 32% Live coding evaluation

TerminalBench Hard 1.5% Agentic terminal tasks

τ²-Bench 15.5% Conversational agent benchmark

IFBench 36.6% Instruction following

LCR 21.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen3 VL 4B (Reasoning) stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

Provider	Alibaba
Release Date	October 14, 2025
Status	Active

Leaderboard Categories

Content Writing Tool Use

Frequently asked questions about Qwen3 VL 4B (Reasoning)

How much does Qwen3 VL 4B (Reasoning) cost?

Qwen3 VL 4B (Reasoning) is currently available for free via OpenRouter.

Who created Qwen3 VL 4B (Reasoning)?

Qwen3 VL 4B (Reasoning) is developed by Alibaba and was released on October 14, 2025.

Qwen3 VL 4B (Reasoning)

Qwen3 VL 4B (Reasoning)

Analysis Summary

Performance Profile

How Qwen3 VL 4B (Reasoning) compares

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Leaderboard Categories

Frequently asked questions about Qwen3 VL 4B (Reasoning)

How much does Qwen3 VL 4B (Reasoning) cost?

Who created Qwen3 VL 4B (Reasoning)?

Qwen3 VL 4B (Reasoning)

Performance Profile

How Qwen3 VL 4B (Reasoning) compares

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Leaderboard Categories

Explore Related Models

Frequently asked questions about Qwen3 VL 4B (Reasoning)

How much does Qwen3 VL 4B (Reasoning) cost?

Who created Qwen3 VL 4B (Reasoning)?