Anthropic

Claude Opus 4.6

Name: Anthropic Claude Opus 4.6
Author: Anthropic

Compare

Model ID:claude-opus-4-6

2026-02-04

Compare

Claude Opus 4.6 is Anthropic's most intelligent model released in February 2026, built for agents that operate across entire workflows rather than single prompts. It features a 1M-token context window, 128K max output tokens, and the ability to spawn and coordinate multiple sub-agents working in parallel — a capability called Agent Teams. With adaptive thinking that dynamically adjusts reasoning depth, the model excels at large codebases, complex refactors, sustained knowledge work, and end-to-end project execution, producing near-production-ready documents and analyses in a single pass.

Anthropic ProAnthropic Max (5x)Anthropic Max (20x)API|VisionReasoningWeb Search|Proprietary Model

Knowledge Cutoff

2025-09-01

The date this AI finished learning. It may not know about things that happened after this date.

Input → Output Format

The types of content this AI can receive, and what it can produce in return.

Context Memory

1MIN128KOUT

The maximum amount of text the AI can read and process in a single request. A larger number means it can handle longer documents or conversations.

Cost/1M Words

$5IN$25OUT

The cost of using this AI directly in your own application. Shown in USD per 1 million units of text (tokens).

Calculate Cost

Source:Official Docs OpenRouter

AI Performance Evaluation

Arena Overall Score

1503

±5

As of 2026-04-23

Overall Rank

🥈 No.2

20,192 Votes

Arena by Ability

Hard Prompts

1535±6🥇 No.1

Expert Knowledge

1544±16🥈 No.2

Instruction Following

1516±9🥇 No.1

Conversation Memory

1514±11🥉 No.3

Creative

1494±11🥈 No.2

Coding

1554±9🥉 No.3

Math

1517±17🥇 No.1

Arena by Occupation

Creative Writing

1494±9🥈 No.2

Social Sciences

1517±11🥈 No.2

Media

1485±10🥇 No.1

Business

1499±10🥈 No.2

Healthcare

1512±16No.7

Legal

1514±16🥇 No.1

Software

1541±8🥈 No.2

Mathematics

1520±20🥈 No.2

Source:Arena Intelligence

Overall

AA Intelligence Index

53%↑15%

LiveBench

77%↑17%

ForecastBench

59%↑0%

Reasoning & Math

GPQA Diamond

90%↑9%

HLE

37%↑20%

LB Reasoning

89%↑29%

LB Math

89%↑16%

LB Data

70%↑20%

Coding

AA Coding Index

48%↑14%

LB Coding

78%↑5%

LB Agentic

62%↑18%

TAU2

92%↑19%

TerminalBench

46%↑15%

SciCode

52%↑11%

Language & Instructions

IFBench

53%↓4%

AA-LCR

71%↑9%

Hallucination (HHEM)

12%↑2%

Factual (HHEM)

88%↓2%

LB Language

83%↑11%

LB IF

63%↑17%

Output Speed

Standard Mode

45tok/s↓37

First Output 1.75s

Reasoning Mode

58tok/s↓30

First Output 12.62s

Source:Artificial Analysis LiveBench ForecastBench Vectara HHEM

Anthropic