Qwen3.5 4B (Reasoning)

Qwen3.5 4B (Reasoning)

Alibaba · Released Mar 2, 2026 Professional
Intelligence #14 / 590
82.0 Our Score
Speed #279 / 279
21.7 tokens / sec
Input #141 / 592
$0.030 per 1M tokens
Output #164 / 592
$0.150 per 1M tokens
Context
— Not reported

Analysis Summary

Qwen3.5 4B (Reasoning) is a small-footprint model from Alibaba with an intelligence index of 20.1, placing it well below frontier tier but competitive for its parameter count. Its agentic index of 55.1 is a standout for a 4B model, suggesting reasonable multi-step task handling. Coding capability is limited at 22.6, and reasoning depth on hard benchmarks is modest.

For businesses, this model fits high-volume, low-complexity automation: classification, routing, summarisation of short documents, and lightweight agentic pipelines where cost per call matters more than answer quality. It is not suited to complex reasoning, code generation, or client-facing content where accuracy is critical.

At $0.03/1M input and $0.15/1M output, it is one of the cheapest options available. Teams running large-scale inference on simple tasks will find strong cost efficiency here, but should pair it with a more capable model for anything requiring nuanced judgement.

Assessed June 30, 2026

Editorial notes

Qwen3.5 4B (Reasoning) is a compact Alibaba model with surprisingly strong agentic scores for its size, ultra-low pricing at $0.03/1M input, but limited reasoning depth keeps it suited to lightweight automation tasks.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.3Technical4.4Value7.3Content6.2
Intelligence 3.3/10
Technical 4.4/10
Content 6.2/10
Value 7.3/10

How Qwen3.5 4B (Reasoning) compares

Qwen3.5 4B (Reasoning) ranks #135 of 385 AI models we track for overall intelligence, #78 of 139 for coding, #77 of 293 for agentic tasks. At $0.03 per million input tokens it is cheaper than 76% of comparable models.

Performance Indices

Source: Artificial Analysis

20.1 Intelligence Index
22.6 Coding Index
55.1 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 77.1% Graduate-level scientific reasoning
HLE 7.8% Humanity's Last Exam
SciCode 16.1% Scientific computing

Technical

TerminalBench Hard 18.2% Agentic terminal tasks
τ²-Bench 92.1% Conversational agent benchmark

Content

IFBench 52% Instruction following
LCR 55.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen3.5 4B (Reasoning) stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

ProviderAlibaba
Release Date March 2, 2026
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.03 $0.000030
Output $0.15 $0.000150

Leaderboard Categories

Frequently asked questions about Qwen3.5 4B (Reasoning)

How much does Qwen3.5 4B (Reasoning) cost?

Qwen3.5 4B (Reasoning) costs $0.03 per million input tokens and $0.15 per million output tokens.

Is Qwen3.5 4B (Reasoning) good for coding?

On our coding benchmark index, Qwen3.5 4B (Reasoning) ranks #78 of 139 models, placing it in the broader range of the field for code generation and debugging.

Who created Qwen3.5 4B (Reasoning)?

Qwen3.5 4B (Reasoning) is developed by Alibaba and was released on March 2, 2026.