Qwen: Qwen3 32B
Analysis Summary
Qwen3 32B is the largest dense model in the Qwen3 open-weight family, offering the strongest benchmark performance among the sub-235B Qwen3 variants. Its coding index of 15.3 and livecodebench score of 0.546 are competitive for its price tier, and a GPQA of 0.668 indicates meaningful scientific reasoning capability. Tool use and function calling are both supported.
For businesses, the 32B is well suited to code generation, technical documentation, and structured content workflows where a small model falls short but a frontier model is cost-prohibitive. Its math index of 73 and AIME score of 0.73 show strong quantitative reasoning. Agentic performance is limited, so complex multi-step tool orchestration is not its strength.
At $0.08 input and $0.28 output per million tokens, it is priced attractively for its capability level. Teams running moderate-complexity coding or analysis tasks at volume will find it a cost-effective workhorse, particularly if they are already in the Qwen ecosystem.
Assessed June 30, 2026
Editorial notes
Qwen3 32B delivers strong coding benchmarks and solid instruction following at a low price, with tool use and function calling, making it a capable mid-tier option for technical workflows.
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
How Qwen: Qwen3 32B compares
Qwen: Qwen3 32B ranks #211 of 385 AI models we track for overall intelligence, #103 of 139 for coding, #210 of 293 for agentic tasks. Its 131K-token context window is larger than 59% of the models we list. At $0.08 per million input tokens it is cheaper than 69% of comparable models.
About Qwen: Qwen3 32B
Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for..
Capabilities
Architecture Detail
| Instruct Type | qwen3 |
Performance Indices
Source: Artificial Analysis
Benchmark Scores
Intelligence
Technical
Content
Benchmark data from Artificial Analysis and Hugging Face
How does Qwen: Qwen3 32B stack up?
Compare side-by-side with other professional models.
Model Information
| OpenRouter ID |
qwen/qwen3-32b
|
| Provider | qwen |
| Release Date | April 28, 2025 |
| Context Length | 131,072 tokens |
| Max Completion | 16,384 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.08 | $0.000080 |
| Output | $0.28 | $0.000280 |
Live Performance
Live endpoint metrics, refreshed every 30 minutes.
External Resources
Explore Related Models
Frequently asked questions about Qwen: Qwen3 32B
How much does Qwen: Qwen3 32B cost?
Qwen: Qwen3 32B costs $0.08 per million input tokens and $0.28 per million output tokens.
What is the context window of Qwen: Qwen3 32B?
Qwen: Qwen3 32B has a context window of 131,072 tokens (131K).
Is Qwen: Qwen3 32B good for coding?
On our coding benchmark index, Qwen: Qwen3 32B ranks #103 of 139 models, placing it in the broader range of the field for code generation and debugging.
What can Qwen: Qwen3 32B do?
Qwen: Qwen3 32B supports tool use and function calling.
Who created Qwen: Qwen3 32B?
Qwen: Qwen3 32B is developed by Qwen and was released on April 28, 2025.
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: July 2, 2026 8:38 pm