Qwen: Qwen3.5-9B

Qwen: Qwen3.5-9B

qwen · Released Mar 10, 2026 Specialist
Intelligence #101 / 544
57.8 Our Score
Speed #165 / 257
67.2 tokens / sec
Input #139 / 551
$0.040 per 1M tokens
Output #151 / 551
$0.150 per 1M tokens
Context #92 / 551
262,144 tokens

Analysis Summary

Qwen: Qwen3.5-9B sits in the Specialist tier on our leaderboard, ranked #101 of 544 published models on overall intelligence. At $0.040 input and $0.150 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use, function calling, vision, and reasoning.

Editorial notes

Qwen3.5-9B is an ultra-cheap multimodal model at $0.10/$0.15 per million tokens with vision and video support, though reasoning and coding benchmarks are limited, best suited to cost-sensitive SEO and content tasks.

Assessed May 5, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence4.9Technical4.8Value8Content4.5
Intelligence 4.9/10
Technical 4.8/10
Content 4.5/10
Value 8/10

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture. It uses a unified vision-language design..

9B Parameters

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

32.4 Intelligence Index
25.3 Coding Index
55.5 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 80.6% Graduate-level scientific reasoning
HLE 13.3% Humanity's Last Exam
SciCode 27.5% Scientific computing

Technical

TerminalBench Hard 24.2% Agentic terminal tasks
τ²-Bench 86.8% Conversational agent benchmark

Content

IFBench 66.7% Instruction following
LCR 59% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen: Qwen3.5-9B stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID qwen/qwen3.5-9b
Providerqwen
Release Date March 10, 2026
Context Length262,144 tokens
Max Completion81,920 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.04 $0.000040
Output $0.15 $0.000150

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

98.8%
Avg Uptime
176ms
Best Latency (TTFT)
50 tok/s
Best Throughput
3/3
Active Endpoints
Available via: DeepInfra, Together, Venice

Leaderboard Categories

SEO