Qwen3 VL 4B Instruct

Qwen3 VL 4B Instruct

Alibaba · Released Oct 14, 2025 Emerging
Intelligence #473 / 556
17.2 Our Score
AA Index #309 / 365
9.6 Artificial Analysis
Input
Not priced
Output
Not priced
Context
Not reported

Analysis Summary

Qwen3 VL 4B Instruct sits in the Emerging tier on our leaderboard, ranked #473 of 556 published models on overall intelligence. At $0.000 input and $0.000 output per 1M tokens, it is among the most expensive on the market.

Editorial notes

Qwen3 VL 4B Instruct is a compact vision-language model with multimodal capability but limited reasoning and coding scores, suited to lightweight vision tasks rather than complex business workflows.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence2.3Technical1.9Value0Content3
Intelligence 2.3/10
Technical 1.9/10
Content 3/10
Value 0/10

Performance Indices

Source: Artificial Analysis

9.6 Intelligence Index
4.6 Coding Index
23.4 Agentic Index
37 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 37.1% Graduate-level scientific reasoning
HLE 3.7% Humanity's Last Exam
MMLU Pro 63.4% Multi-task language understanding
AIME 2025 37% Competition mathematics (2025)
SciCode 13.7% Scientific computing

Technical

LiveCodeBench 29% Live coding evaluation
τ²-Bench 23.4% Conversational agent benchmark

Content

IFBench 31.8% Instruction following
LCR 13% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen3 VL 4B Instruct stack up?

Compare side-by-side with other emerging models.

Compare Models

Model Information

ProviderAlibaba
Release Date October 14, 2025
Status Active