DeepSeek: DeepSeek V3.2

DeepSeek: DeepSeek V3.2

deepseek · Released Dec 1, 2025
65
Our Score

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism that reduces training and inference cost while preserving quality in long-context scenarios. A scalable reinforcement learning post-training framework further improves reasoning, with reported performance in the GPT-5 class, and the model has demonstrated gold-medal results on the 2025 IMO and IOI. V3.2 also uses a large-scale agentic task synthesis pipeline to better integrate reasoning into tool-use settings, boosting compliance and generalization in interactive environments. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs

$0.26 / 1M Input Price
$0.38 / 1M Output Price
163,840 tokens Context Window

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerDeepSeek

Performance Indices

Source: Artificial Analysis

41.7 Intelligence Index
36.7 Coding Index
63.1 Agentic Index
92 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 84%
Graduate-level scientific reasoning
HLE 22.2%
Humanity's Last Exam
MMLU Pro 86.2%
Multi-task language understanding
LiveCodeBench 86.2%
Live coding evaluation
SciCode 38.9%
Scientific computing
AIME 2025 92%
Competition mathematics (2025)
IFBench 60.7%
Instruction following
LCR 65%
Long-context reasoning
TerminalBench Hard 35.6%
Agentic terminal tasks
τ²-Bench 90.6%
Conversational agent benchmark

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID deepseek/deepseek-v3.2
Providerdeepseek
Model FamilyDeepSeek
Release Date December 1, 2025
Context Length163,840 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.26 $0.000260
Output $0.38 $0.000380

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99%
Avg Uptime
1,082ms
Best Latency (TTFT)
26 tok/s
Best Throughput
9/9
Active Endpoints
Available via: DeepInfra, AtlasCloud, Novita, Ionstream, SiliconFlow, DeepSeek, Parasail, Google

Leaderboard Categories