Qwen3.5 4B (Reasoning)

Qwen3.5 4B (Reasoning)

Alibaba · Released Mar 2, 2026 Specialist
Intelligence #135 / 556
52.2 Our Score
Speed #36 / 257
201.3 tokens / sec
Input #130 / 557
$0.030 per 1M tokens
Output #154 / 557
$0.150 per 1M tokens
Context
Not reported

Analysis Summary

Qwen3.5 4B (Reasoning) sits in the Specialist tier on our leaderboard, ranked #135 of 556 published models on overall intelligence. At $0.030 input and $0.150 output per 1M tokens, it is among the most expensive on the market.

Editorial notes

Qwen3.5 4B Reasoning offers a strong agentic index and good instruction-following at just $0.03 input, making it a cost-effective option for structured content tasks despite limited overall intelligence.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence4.8Technical4.3Value7.3Content4.5
Intelligence 4.8/10
Technical 4.3/10
Content 4.5/10
Value 7.3/10

Performance Indices

Source: Artificial Analysis

27.1 Intelligence Index
17.5 Coding Index
55.1 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 77.1% Graduate-level scientific reasoning
HLE 7.8% Humanity's Last Exam
SciCode 16.1% Scientific computing

Technical

TerminalBench Hard 18.2% Agentic terminal tasks
τ²-Bench 92.1% Conversational agent benchmark

Content

IFBench 52% Instruction following
LCR 55.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen3.5 4B (Reasoning) stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

ProviderAlibaba
Release Date March 2, 2026
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.03 $0.000030
Output $0.15 $0.000150

Leaderboard Categories

SEO