DeepSeek
DeepSeek

DeepSeek V4 Pro

2026-04-24

DeepSeek V4 Pro is DeepSeek's flagship open-source frontier model, released April 24, 2026, with 1.6 trillion total parameters (49B active) — the largest open-weight model to date. It introduces a hybrid attention architecture combining Compressed Sparse Attention (CSA) and Heavily Compressed Attention (HCA), reducing single-token FLOPs to 27% and KV cache to 10% compared to V3.2 at 1M-token context. Trained with mixed FP4/FP8 precision and Manifold-Constrained Hyper-Connections, it supports dual Thinking and Non-Thinking modes across a 1M-token context window. At launch, it scored 80.6% on SWE-bench Verified (tying Claude Opus 4.6), 93.5% on LiveCodeBench, and a 3206 Codeforces rating, while costing roughly one-seventh the price of comparable frontier models.

Reasoning|Proprietary Model
Knowledge Cutoff
Unknown
Input → Output Format
Context Memory
1.0MIN384KOUT
Cost/1M Words
$1.74IN$3.48OUT
Calculate Cost

AI Performance Evaluation

Arena Overall Score
1463
±9
As of 2026-04-23
Overall Rank
No.20
4,163 Votes
Arena by Ability
Hard Prompts
1477±11No.31
Expert Knowledge
1480±29No.32
Instruction Following
1451±16No.26
Conversation Memory
1479±22No.18
Creative
1448±24No.16
Coding
1480±17No.50
Math
1446±32No.35
Arena by Occupation
Creative Writing
1443±19No.28
Social Sciences
1478±21No.26
Media
1432±22No.28
Business
1458±20No.26
Healthcare
1521±32🥇 No.1
Legal
1494±30No.8
Software
1479±14No.43
Mathematics
1449±32No.36
Overall
AA Intelligence Index
52%↑13%
Reasoning & Math
GPQA Diamond
89%↑8%
HLE
36%↑19%
Coding
AA Coding Index
48%↑13%
TAU2
96%↑23%
TerminalBench
46%↑15%
SciCode
50%↑9%
Language & Instructions
IFBench
77%↑20%
AA-LCR
66%↑5%
Output Speed
Standard Mode
32tok/s↓50
First Output 1.54s