Qwen3 4B (Reasoning)

Qwen3 4B (Reasoning)

Alibaba · Released Apr 28, 2025 Efficient
Intelligence #205 / 556
40.5 Our Score
Speed #113 / 257
103.0 tokens / sec
Input #215 / 557
$0.110 per 1M tokens
Output #336 / 557
$1.26 per 1M tokens
Context
Not reported

Analysis Summary

Qwen3 4B (Reasoning) sits in the Efficient tier on our leaderboard, ranked #205 of 556 published models on overall intelligence. At $0.110 input and $1.26 output per 1M tokens, it is among the most expensive on the market.

Editorial notes

Qwen3 4B Reasoning from Alibaba offers strong livecodebench scores for a sub-$0.15 model, but limited agentic and reasoning depth caps its usefulness for complex business tasks.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.1Technical2.8Value7Content3.5
Intelligence 3.1/10
Technical 2.8/10
Content 3.5/10
Value 7/10

Performance Indices

Source: Artificial Analysis

14.2 Intelligence Index
19 Agentic Index
22.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 52.2% Graduate-level scientific reasoning
HLE 5.1% Humanity's Last Exam
MMLU Pro 69.6% Multi-task language understanding
MATH 500 93.3% Mathematical problem-solving
AIME 65.7% Competition mathematics
AIME 2025 22.3% Competition mathematics (2025)
SciCode 3.5% Scientific computing

Technical

LiveCodeBench 46.5% Live coding evaluation
τ²-Bench 19% Conversational agent benchmark

Content

IFBench 32.5% Instruction following

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen3 4B (Reasoning) stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

ProviderAlibaba
Release Date April 28, 2025
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.11 $0.000110
Output $1.26 $0.001260