Qwen3.5 2B (Reasoning)

Qwen3.5 2B (Reasoning)

Alibaba · Released Mar 2, 2026 Professional
Intelligence #14 / 590
82.0 Our Score
Speed #278 / 279
21.7 tokens / sec
Input #135 / 592
$0.020 per 1M tokens
Output #142 / 592
$0.100 per 1M tokens
Context
— Not reported

Analysis Summary

Qwen3.5 2B (Reasoning) sits in the lower-mid range of the Alibaba small-model lineup with an intelligence index of 10.2 and a coding index of 19.7. The agentic index of 36.4 is below average even for small models, and instruction following (ifbench 0.31) and long-context reasoning (lcr 0.24) are limited. GPQA at 0.456 shows some scientific reasoning capability relative to its size.

For businesses, this model is appropriate for simple structured extraction, short-form summarisation, and lightweight classification tasks. It is not suited to code generation, complex reasoning chains, or any workflow requiring reliable multi-step output.

At $0.02/1M input, it offers strong economy for its tier. Teams should use it as a cost-efficient component for simple, well-defined tasks within a larger pipeline, not as a standalone reasoning engine.

Assessed June 30, 2026

Editorial notes

Qwen3.5 2B (Reasoning) is a small Alibaba model with an intelligence index of 10.2 and very low pricing, suited to simple structured tasks but not competitive for reasoning or coding workloads.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence1.7Technical3Value7.3Content3.1
Intelligence 1.7/10
Technical 3/10
Content 3.1/10
Value 7.3/10

How Qwen3.5 2B (Reasoning) compares

Qwen3.5 2B (Reasoning) ranks #226 of 385 AI models we track for overall intelligence, #126 of 293 for agentic tasks. At $0.02 per million input tokens it is cheaper than 77% of comparable models.

Performance Indices

Source: Artificial Analysis

10.2 Intelligence Index
36.4 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 45.6% Graduate-level scientific reasoning
HLE 2.1% Humanity's Last Exam
SciCode 2.8% Scientific computing

Technical

TerminalBench Hard 3.8% Agentic terminal tasks
τ²-Bench 69% Conversational agent benchmark

Content

IFBench 31.5% Instruction following
LCR 23.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen3.5 2B (Reasoning) stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

ProviderAlibaba
Release Date March 2, 2026
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.02 $0.000020
Output $0.10 $0.000100

Frequently asked questions about Qwen3.5 2B (Reasoning)

How much does Qwen3.5 2B (Reasoning) cost?

Qwen3.5 2B (Reasoning) costs $0.02 per million input tokens and $0.10 per million output tokens.

Who created Qwen3.5 2B (Reasoning)?

Qwen3.5 2B (Reasoning) is developed by Alibaba and was released on March 2, 2026.