Qwen: Qwen3 32B

Qwen: Qwen3 32B

qwen · Released Apr 28, 2025 Professional
Intelligence #14 / 590
82.0 Our Score
Speed #127 / 279
101.5 tokens / sec
Input #185 / 592
$0.080 per 1M tokens
Output #208 / 592
$0.280 per 1M tokens
Context #245 / 592
131,072 tokens

Analysis Summary

Qwen3 32B is the largest dense model in the Qwen3 open-weight family, offering the strongest benchmark performance among the sub-235B Qwen3 variants. Its coding index of 15.3 and livecodebench score of 0.546 are competitive for its price tier, and a GPQA of 0.668 indicates meaningful scientific reasoning capability. Tool use and function calling are both supported.

For businesses, the 32B is well suited to code generation, technical documentation, and structured content workflows where a small model falls short but a frontier model is cost-prohibitive. Its math index of 73 and AIME score of 0.73 show strong quantitative reasoning. Agentic performance is limited, so complex multi-step tool orchestration is not its strength.

At $0.08 input and $0.28 output per million tokens, it is priced attractively for its capability level. Teams running moderate-complexity coding or analysis tasks at volume will find it a cost-effective workhorse, particularly if they are already in the Qwen ecosystem.

Assessed June 30, 2026

Editorial notes

Qwen3 32B delivers strong coding benchmarks and solid instruction following at a low price, with tool use and function calling, making it a capable mid-tier option for technical workflows.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence2.8Technical2.7Value7.8Content4
Intelligence 2.8/10
Technical 2.7/10
Content 4/10
Value 7.8/10

How Qwen: Qwen3 32B compares

Qwen: Qwen3 32B ranks #211 of 385 AI models we track for overall intelligence, #103 of 139 for coding, #210 of 293 for agentic tasks. Its 131K-token context window is larger than 59% of the models we list. At $0.08 per million input tokens it is cheaper than 69% of comparable models.

About Qwen: Qwen3 32B

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for..

32B Parameters

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Typeqwen3

Performance Indices

Source: Artificial Analysis

11.5 Intelligence Index
15.3 Coding Index
16.4 Agentic Index
73 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 66.8% Graduate-level scientific reasoning
HLE 8.3% Humanity's Last Exam
MMLU Pro 79.8% Multi-task language understanding
MATH 500 96.1% Mathematical problem-solving
AIME 80.7% Competition mathematics
AIME 2025 73% Competition mathematics (2025)
SciCode 35.4% Scientific computing

Technical

LiveCodeBench 54.6% Live coding evaluation
TerminalBench Hard 3% Agentic terminal tasks
τ²-Bench 29.8% Conversational agent benchmark

Content

IFBench 36.3% Instruction following

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen: Qwen3 32B stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID qwen/qwen3-32b
Providerqwen
Release Date April 28, 2025
Context Length131,072 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.08 $0.000080
Output $0.28 $0.000280

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

99.5%
Avg Uptime
300ms
Best Latency (TTFT)
437 tok/s
Best Throughput
5/5
Active Endpoints
Available via: DeepInfra, Nebius, Alibaba, SiliconFlow, Groq

Leaderboard Categories

Frequently asked questions about Qwen: Qwen3 32B

How much does Qwen: Qwen3 32B cost?

Qwen: Qwen3 32B costs $0.08 per million input tokens and $0.28 per million output tokens.

What is the context window of Qwen: Qwen3 32B?

Qwen: Qwen3 32B has a context window of 131,072 tokens (131K).

Is Qwen: Qwen3 32B good for coding?

On our coding benchmark index, Qwen: Qwen3 32B ranks #103 of 139 models, placing it in the broader range of the field for code generation and debugging.

What can Qwen: Qwen3 32B do?

Qwen: Qwen3 32B supports tool use and function calling.

Who created Qwen: Qwen3 32B?

Qwen: Qwen3 32B is developed by Qwen and was released on April 28, 2025.