Qwen: Qwen3 32B

Qwen: Qwen3 32B

qwen · Released Apr 28, 2025 Efficient
Intelligence #168 / 525
44.3 Our Score
Speed #116 / 244
100.2 tokens / sec
Input #162 / 525
$0.080 per 1M tokens
Output #174 / 525
$0.240 per 1M tokens
Context #330 / 525
40,960 tokens

Analysis Summary

Qwen: Qwen3 32B sits in the Efficient tier on our leaderboard, ranked #168 of 525 published models on overall intelligence. At $0.080 input and $0.240 output per 1M tokens, it is among the most expensive on the market. It offers a mid-sized context window and supports tool use, function calling, and reasoning.

Editorial notes

Qwen3 32B delivers competitive coding and math performance at a low price with tool use support, making it one of the better value mid-tier options — though regional accessibility may be a consideration for UK businesses.

Assessed April 23, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.7Technical2.8Value7.5Content4
Intelligence 3.7/10
Technical 2.8/10
Content 4/10
Value 7.5/10

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for..

32B Parameters

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Typeqwen3

Performance Indices

Source: Artificial Analysis

16.5 Intelligence Index
13.8 Coding Index
16.4 Agentic Index
73 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 66.8% Graduate-level scientific reasoning
HLE 8.3% Humanity's Last Exam
MMLU Pro 79.8% Multi-task language understanding
MATH 500 96.1% Mathematical problem-solving
AIME 80.7% Competition mathematics
AIME 2025 73% Competition mathematics (2025)
SciCode 35.4% Scientific computing

Technical

LiveCodeBench 54.6% Live coding evaluation
TerminalBench Hard 3% Agentic terminal tasks
τ²-Bench 29.8% Conversational agent benchmark

Content

IFBench 36.3% Instruction following

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen: Qwen3 32B stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID qwen/qwen3-32b
Providerqwen
Release Date April 28, 2025
Context Length40,960 tokens
Max Completion40,960 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.08 $0.000080
Output $0.24 $0.000240

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.2%
Avg Uptime
229ms
Best Latency (TTFT)
304 tok/s
Best Throughput
6/8
Active Endpoints
Available via: Chutes, DeepInfra, Nebius, Novita, AtlasCloud, Alibaba, SiliconFlow, Groq

Leaderboard Categories