Home > AI Models > DeepSeek: R1 Distill Qwen 32B

DeepSeek: R1 Distill Qwen 32B

Name: DeepSeek: R1 Distill Qwen 32B Review
Item: DeepSeek: R1 Distill Qwen 32B
Rating: 3.5
Author: Design for Online

DeepSeek: R1 Distill Qwen 32B

deepseek · Released Jan 29, 2025 Efficient

Intelligence #238 / 525

34.8 Our Score

Speed #207 / 244

43.6 tokens / sec

Input #295 / 525

$0.290 per 1M tokens

Output #187 / 525

$0.290 per 1M tokens

Context #336 / 525

32,768 tokens

DeepSeek: R1 Distill Qwen 32B sits in the Efficient tier on our leaderboard, ranked #238 of 525 published models on overall intelligence. At $0.290 input and $0.290 output per 1M tokens, it is among the most expensive on the market. It offers a mid-sized context window and supports reasoning.

Editorial notes

DeepSeek's R1 Distill Qwen 32B shows strong mathematical reasoning but limited overall intelligence and coding performance, with a restricted 32K context window. It offers reasonable pricing but limited accessibility for Western businesses reduces its practical appeal.

Assessed April 23, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: Yes
Input
Output
Context: 32,768 tokens
Max output: 32,768 tokens
Tokenizer: Qwen
Released: Jan 29, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on Qwen 2.5 32B, using outputs from DeepSeek R1. It outperforms OpenAI's o1-mini across various benchmarks, achieving new..

32B Parameters

Architecture Detail

Instruct Type deepseek-r1

Performance Indices

Source: Artificial Analysis

17.2 Intelligence Index

63 Math Index

Benchmark Scores

GPQA Diamond 61.5% Graduate-level scientific reasoning

HLE 5.5% Humanity's Last Exam

MMLU Pro 73.9% Multi-task language understanding

MATH 500 94.1% Mathematical problem-solving

AIME 68.7% Competition mathematics

AIME 2025 63% Competition mathematics (2025)

SciCode 37.6% Scientific computing

LiveCodeBench 27% Live coding evaluation

IFBench 22.9% Instruction following

LCR 9.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does DeepSeek: R1 Distill Qwen 32B stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID	`deepseek/deepseek-r1-distill-qwen-32b`
Provider	deepseek
Model Family	DeepSeek
Release Date	January 29, 2025
Context Length	32,768 tokens
Max Completion	32,768 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.29	$0.000290
Output	$0.29	$0.000290

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%

Avg Uptime

737ms

Best Latency (TTFT)

26 tok/s

Best Throughput

1/1

Active Endpoints

Available via: NextBit

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

DeepSeek: R1 Distill Qwen 32B

DeepSeek: R1 Distill Qwen 32B

Analysis Summary

Performance Profile

Architecture Detail

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

External Resources

DeepSeek: R1 Distill Qwen 32B

Performance Profile

Architecture Detail

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

External Resources

Explore Related Models