DeepSeek: DeepSeek V3.1 Terminus

DeepSeek: DeepSeek V3.1 Terminus

deepseek · Released Sep 22, 2025 Specialist
Intelligence #89 / 556
60.5 Our Score
AA Index #108 / 365
28.5 Artificial Analysis
Input #311 / 557
$0.270 per 1M tokens
Output #306 / 557
$0.950 per 1M tokens
Context #206 / 557
163,840 tokens

Analysis Summary

DeepSeek: DeepSeek V3.1 Terminus sits in the Specialist tier on our leaderboard, ranked #89 of 556 published models on overall intelligence. At $0.270 input and $0.950 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports tool use and function calling.

Editorial notes

DeepSeek V3.1 Terminus posts the strongest terminal benchmark in this batch and solid coding scores, with tool use and function calling at competitive pricing; note the -4 regional penalty applies.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence5.4Technical5.5Value7.8Content5.5
Intelligence 5.4/10
Technical 5.5/10
Content 5.5/10
Value 7.8/10

DeepSeek-V3.1 Terminus is an update to DeepSeek V3.1 that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's..

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Typedeepseek-v3.1

Performance Indices

Source: Artificial Analysis

28.5 Intelligence Index
31.9 Coding Index
34.5 Agentic Index
53.7 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 75.1% Graduate-level scientific reasoning
HLE 8.4% Humanity's Last Exam
MMLU Pro 83.6% Multi-task language understanding
AIME 2025 53.7% Competition mathematics (2025)
SciCode 32.1% Scientific computing

Technical

LiveCodeBench 52.9% Live coding evaluation
TerminalBench Hard 31.8% Agentic terminal tasks
τ²-Bench 37.1% Conversational agent benchmark

Content

IFBench 41.2% Instruction following
LCR 43.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does DeepSeek: DeepSeek V3.1 Terminus stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID deepseek/deepseek-v3.1-terminus
Providerdeepseek
Model FamilyDeepSeek
Release Date September 22, 2025
Context Length163,840 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.27 $0.000270
Output $0.95 $0.000950

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.9%
Avg Uptime
685ms
Best Latency (TTFT)
27 tok/s
Best Throughput
4/4
Active Endpoints
Available via: DeepInfra, SiliconFlow, Novita, AtlasCloud

Leaderboard Categories