Qwen: Qwen3.5-35B-A3B

Qwen: Qwen3.5-35B-A3B

qwen · Released Feb 25, 2026 New
70
Our Score

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, achieving higher inference efficiency. Its overall performance is comparable to that of the Qwen3.5-27B.

$0.16 / 1M Input Price
$1.30 / 1M Output Price
262,144 tokens Context Window
65,536 tokens Max Output
35B Parameters

Capabilities

Tool Use Function Calling Vision

Architecture

ModalityText + Image + Video → Text
TokenizerQwen3
Parameters35B

Performance Indices

Source: Artificial Analysis

37.1 Intelligence Index
30.3 Coding Index
57.9 Agentic Index

Benchmark Scores

Evaluations

GPQA Diamond 84.5%
Graduate-level scientific reasoning
HLE 19.7%
Humanity's Last Exam
SciCode 37.7%
Scientific computing
IFBench 72.5%
Instruction following
LCR 62.7%
Long-context reasoning
TerminalBench Hard 26.5%
Agentic terminal tasks
τ²-Bench 89.2%
Conversational agent benchmark

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID qwen/qwen3.5-35b-a3b
Providerqwen
Release Date February 25, 2026
Context Length262,144 tokens
Max Completion65,536 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.16 $0.000163
Output $1.30 $0.001300

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

94.2%
Avg Uptime
565ms
Best Latency (TTFT)
108 tok/s
Best Throughput
5/5
Active Endpoints
Available via: Alibaba, AtlasCloud, Ionstream, Parasail, Venice

Leaderboard Categories