DeepSeek: DeepSeek V3.2

DeepSeek: DeepSeek V3.2

deepseek · Released Dec 1, 2025 Specialist
66.3
Our Score

Performance Profile

Intelligence5.9Technical6.7Value7.8Content6
Intelligence 5.9/10
Technical 6.7/10
Content 6/10
Value 7.8/10

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism that reduces training and inference cost while preserving quality in long-context scenarios. A scalable reinforcement learning post-training framework further improves reasoning, with reported performance in the GPT-5 class, and the model has demonstrated gold-medal results on the 2025 IMO and IOI. V3.2 also uses a large-scale agentic task synthesis pipeline to better integrate reasoning into tool-use settings, boosting compliance and generalization in interactive environments. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs

$0.26 / 1M
Input Price
$0.38 / 1M
Output Price
163,840 tokens
Context Window

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerDeepSeek

Performance Indices

Source: Artificial Analysis

32.1 Intelligence Index
34.6 Coding Index
55.8 Agentic Index
59 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 75.1% Graduate-level scientific reasoning
HLE 10.5% Humanity's Last Exam
MMLU Pro 83.7% Multi-task language understanding
AIME 2025 59% Competition mathematics (2025)
SciCode 38.7% Scientific computing

Technical

LiveCodeBench 59.3% Live coding evaluation
TerminalBench Hard 32.6% Agentic terminal tasks
τ²-Bench 78.9% Conversational agent benchmark

Content

IFBench 49% Instruction following
LCR 39% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID deepseek/deepseek-v3.2
Providerdeepseek
Model FamilyDeepSeek
Release Date December 1, 2025
Context Length163,840 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.26 $0.000260
Output $0.38 $0.000380

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.5%
Avg Uptime
251ms
Best Latency (TTFT)
85 tok/s
Best Throughput
13/13
Active Endpoints
Available via: DeepInfra, AtlasCloud, Novita, SiliconFlow, DeepSeek, AkashML, Chutes, Parasail +4 more

Leaderboard Categories