1 / 3
Swipe to compare

Claude Sonnet 4 is Anthropic's balanced mid-tier model released alongside Opus 4 in May 2025, designed to combine strong coding and reasoning capabilities with computational efficiency. It achieves state-of-the-art 72.7% on SWE-bench while offering significantly lower cost and faster response times than Opus models. Key strengths include autonomous codebase navigation, reduced error rates in agent-driven workflows, and high reliability in following intricate instructions, making it a versatile choice for both routine and complex development tasks.

Author
AnthropicAnthropic
Release Date
2025-05-22
Knowledge Cutoff
2025-01-31
License
Proprietary
I/O Format
Context Length
1M / 64K
API I/O (1M)
$3 / $15
How to Use
API Access
Output Speed
45 tok/s
Arena Overall
1399
Intelligence Index
38.7
Coding Index
34.1
Math Index
74.3
LiveBench
60.6
ForecastBench
58.7
GPQA Diamond
77.7%
HLE
9.6%
MMLU-Pro
84.2%
AIME 2025
74.3%
MATH-500
99.1%
LB Reasoning
69.0
LB Math
70.5
LB Data Analysis
54.6
LiveCodeBench
65.5%
LB Coding
77.5
LB Agentic
40.0
TAU2
64.6%
TerminalBench
31.1%
SciCode
40.0%
IFBench
54.7%
AA-LCR
0.6
Hallucination (HHEM)
10.3%
Factual Consistency (HHEM)
89.7%
LB Language
72.9
LB Instruction Following
44.3