Qwen3.5 4B (Reasoning)

Qwen3.5 4B (Reasoning)

Alibaba · Released Mar 2, 2026 Specialist
Intelligence #129 / 551
52.8 Our Score
Speed #34 / 257
189.6 tokens / sec
Input #128 / 552
$0.030 per 1M tokens
Output #152 / 552
$0.150 per 1M tokens
Context
Not reported

Analysis Summary

Qwen3.5 4B (Reasoning) sits in the Specialist tier on our leaderboard, ranked #129 of 551 published models on overall intelligence. At $0.030 input and $0.150 output per 1M tokens, it is among the most expensive on the market.

Editorial notes

Qwen3.5 4B (Reasoning) from Alibaba punches above its size on math and agentic tasks at very low cost, but reasoning and coding indices remain limited for serious business workflows.

Assessed May 5, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence4.8Technical4.3Value7.3Content4.8
Intelligence 4.8/10
Technical 4.3/10
Content 4.8/10
Value 7.3/10

Performance Indices

Source: Artificial Analysis

27.1 Intelligence Index
17.5 Coding Index
55.2 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 77.1% Graduate-level scientific reasoning
HLE 7.8% Humanity's Last Exam
SciCode 16.1% Scientific computing

Technical

TerminalBench Hard 18.2% Agentic terminal tasks
τ²-Bench 92.1% Conversational agent benchmark

Content

IFBench 52% Instruction following
LCR 55.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen3.5 4B (Reasoning) stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

ProviderAlibaba
Release Date March 2, 2026
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.03 $0.000030
Output $0.15 $0.000150

Leaderboard Categories