Home > AI Models > Qwen: Qwen3 32B

Qwen: Qwen3 32B

Name: Qwen: Qwen3 32B Review
Item: Qwen: Qwen3 32B
Rating: 4.4
Author: Design for Online

Qwen: Qwen3 32B

qwen · Released Apr 28, 2025 Efficient

Intelligence #178 / 556

44.3 Our Score

Speed #110 / 257

105.0 tokens / sec

Input #176 / 557

$0.080 per 1M tokens

Output #196 / 557

$0.280 per 1M tokens

Context #220 / 557

131,072 tokens

Qwen: Qwen3 32B sits in the Efficient tier on our leaderboard, ranked #178 of 556 published models on overall intelligence. At $0.080 input and $0.280 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports tool use, function calling, and reasoning.

Editorial notes

Qwen3 32B delivers strong livecodebench and math scores with tool use and function calling at $0.08/$0.28 per million tokens, offering good coding value despite modest overall intelligence benchmarks.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: Yes
Input
Output
Context: 131,072 tokens
Max output: 16,384 tokens
Tokenizer: Qwen3
Released: Apr 28, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for..

32B Parameters

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Type qwen3

Performance Indices

Source: Artificial Analysis

16.5 Intelligence Index

13.8 Coding Index

16.4 Agentic Index

73 Math Index

Benchmark Scores

GPQA Diamond 66.8% Graduate-level scientific reasoning

HLE 8.3% Humanity's Last Exam

MMLU Pro 79.8% Multi-task language understanding

MATH 500 96.1% Mathematical problem-solving

AIME 80.7% Competition mathematics

AIME 2025 73% Competition mathematics (2025)

SciCode 35.4% Scientific computing

LiveCodeBench 54.6% Live coding evaluation

TerminalBench Hard 3% Agentic terminal tasks

τ²-Bench 29.8% Conversational agent benchmark

IFBench 36.3% Instruction following

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen: Qwen3 32B stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID	`qwen/qwen3-32b`
Provider	qwen
Release Date	April 28, 2025
Context Length	131,072 tokens
Max Completion	16,384 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.08	$0.000080
Output	$0.28	$0.000280

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

97.7%

Avg Uptime

327ms

Best Latency (TTFT)

502 tok/s

Best Throughput

7/7

Active Endpoints

Available via: DeepInfra, Nebius, Novita, AtlasCloud, Alibaba, SiliconFlow, Groq

Leaderboard Categories

Coding Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Qwen: Qwen3 32B

Qwen: Qwen3 32B

Analysis Summary

Performance Profile

Capabilities

Architecture Detail

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Qwen: Qwen3 32B

Performance Profile

Capabilities

Architecture Detail

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Explore Related Models