DeepSeek: DeepSeek V3.1 Terminus (exacto)

DeepSeek: DeepSeek V3.1 Terminus (exacto)

deepseek · Released Sep 22, 2025
30
Our Score

DeepSeek-V3.1 Terminus is an update to DeepSeek V3.1 that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's performance in coding and search agents. It is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes. It extends the DeepSeek-V3 base with a two-phase long-context training process, reaching up to 128K tokens, and uses FP8 microscaling for efficient inference. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs The model improves tool use, code generation, and reasoning efficiency, achieving performance comparable to DeepSeek-R1 on difficult benchmarks while responding more quickly. It supports structured tool calling, code agents, and search agents, making it suitable for research, coding, and agentic workflows.

$0.21 / 1M Input Price
$0.79 / 1M Output Price
163,840 tokens Context Window

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerDeepSeek
Instruct Typedeepseek-v3.1

Model Information

OpenRouter ID deepseek/deepseek-v3.1-terminus:exacto
Providerdeepseek
Model FamilyDeepSeek
Release Date September 22, 2025
Context Length163,840 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.21 $0.000210
Output $0.79 $0.000790

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

97.1%
Avg Uptime
991ms
Best Latency (TTFT)
42.5 tok/s
Best Throughput
5/6
Active Endpoints
Available via: DeepInfra, Chutes, Novita, SiliconFlow, AtlasCloud, SambaNova