DeepSeek: DeepSeek V4 Flash

DeepSeek: DeepSeek V4 Flash

deepseek · Released Apr 24, 2026 Professional New
Intelligence #18 / 556
82.2 Our Score
Speed #113 / 257
102.1 tokens / sec
Input #222 / 557
$0.112 per 1M tokens
Output #186 / 557
$0.224 per 1M tokens
Context #15 / 557
1M tokens

Analysis Summary

DeepSeek: DeepSeek V4 Flash sits in the Professional tier on our leaderboard, ranked #18 of 556 published models on overall intelligence. At $0.112 input and $0.224 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use and function calling.

Editorial notes

DeepSeek V4 Flash pairs excellent reasoning and agentic capability with a 1M token context and very aggressive pricing, making it a strong value option for business workflows.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence8.3Technical7.5Value8.3Content7.5
Intelligence 8.3/10
Technical 7.5/10
Content 7.5/10
Value 8.3/10

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and..

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

46.5 Intelligence Index
38.7 Coding Index
65.3 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 89.4% Graduate-level scientific reasoning
HLE 32.1% Humanity's Last Exam
SciCode 44.9% Scientific computing

Technical

TerminalBench Hard 35.6% Agentic terminal tasks
τ²-Bench 95% Conversational agent benchmark

Content

IFBench 79.2% Instruction following
LCR 63% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does DeepSeek: DeepSeek V4 Flash stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID deepseek/deepseek-v4-flash
Providerdeepseek
Model FamilyDeepSeek
Release Date April 24, 2026
Context Length1,048,576 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.11 $0.000112
Output $0.22 $0.000224

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

91.2%
Avg Uptime
483ms
Best Latency (TTFT)
106 tok/s
Best Throughput
11/12
Active Endpoints
Available via: GMICloud, Baidu, DeepSeek, SiliconFlow, Parasail, AtlasCloud, DeepInfra, AkashML +4 more

Leaderboard Categories