DeepSeek: DeepSeek V3.1 Terminus

DeepSeek: DeepSeek V3.1 Terminus

deepseek · Released Sep 22, 2025 Specialist
Intelligence #93 / 523
58.8 Our Score
AA Index #72 / 353
33.9 Artificial Analysis
Input #270 / 525
$0.210 per 1M tokens
Output #274 / 525
$0.790 per 1M tokens
Context #174 / 525
163,840 tokens

Analysis Summary

DeepSeek: DeepSeek V3.1 Terminus sits in the Specialist tier on our leaderboard, ranked #93 of 523 published models on overall intelligence. At $0.210 input and $0.790 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports tool use and function calling.

Editorial notes

DeepSeek V3.1 Terminus offers solid coding performance and reasonable reasoning at a competitive price point, with tool use and function calling support. It's a capable mid-tier option for coding-focused tasks, though limited Western provider support is a practical consideration for UK businesses.

Assessed April 24, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence5.4Technical5.5Value7.8Content4.5
Intelligence 5.4/10
Technical 5.5/10
Content 4.5/10
Value 7.8/10

DeepSeek-V3.1 Terminus is an update to DeepSeek V3.1 that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's..

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Typedeepseek-v3.1

Performance Indices

Source: Artificial Analysis

33.9 Intelligence Index
33.7 Coding Index
33.7 Agentic Index
89.7 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 79.2% Graduate-level scientific reasoning
HLE 15.2% Humanity's Last Exam
MMLU Pro 85.1% Multi-task language understanding
AIME 2025 89.7% Competition mathematics (2025)
SciCode 40.6% Scientific computing

Technical

LiveCodeBench 79.8% Live coding evaluation
TerminalBench Hard 30.3% Agentic terminal tasks
τ²-Bench 37.1% Conversational agent benchmark

Content

IFBench 57% Instruction following
LCR 65% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does DeepSeek: DeepSeek V3.1 Terminus stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID deepseek/deepseek-v3.1-terminus
Providerdeepseek
Model FamilyDeepSeek
Release Date September 22, 2025
Context Length163,840 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.21 $0.000210
Output $0.79 $0.000790

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.8%
Avg Uptime
674ms
Best Latency (TTFT)
29 tok/s
Best Throughput
4/4
Active Endpoints
Available via: DeepInfra, Novita, SiliconFlow, AtlasCloud

Leaderboard Categories