AI Model Comparison

Our Story

Claude Sonnet 4 is Anthropic's balanced mid-tier model released alongside Opus 4 in May 2025, designed to combine strong coding and reasoning capabilities with computational efficiency. It achieves state-of-the-art 72.7% on SWE-bench while offering significantly lower cost and faster response times than Opus models. Key strengths include autonomous codebase navigation, reduced error rates in agent-driven workflows, and high reliability in following intricate instructions, making it a versatile choice for both routine and complex development tasks.

Author

Anthropic

Release Date

2025-05-22

Knowledge Cutoff

2025-01-31

License

Proprietary

I/O Format

Context Length

1M / 64K

API I/O (1M)

$3 / $15

How to Use

API Access

Output Speed

45 tok/s

Arena Overall

1399

Intelligence Index

38.7

Coding Index

34.1

Math Index

74.3

LiveBench

60.6

ForecastBench

58.7

GPQA Diamond

77.7%

HLE

9.6%

MMLU-Pro

84.2%

AIME 2025

74.3%

MATH-500

99.1%

LB Reasoning

69.0

LB Math

70.5

LB Data Analysis

54.6

LiveCodeBench

65.5%

LB Coding

77.5

LB Agentic

40.0

TAU2

64.6%

TerminalBench

31.1%

SciCode

40.0%

IFBench

54.7%

AA-LCR

0.6

Hallucination (HHEM)

10.3%

Factual Consistency (HHEM)

89.7%

LB Language

72.9

LB Instruction Following

44.3

Calculate Cost View Model Details

1 / 3

Swipe to compare