ModelBeat
Models/Claude Opus 4.5
All models
A

Anthropic: Claude Opus 4.5

anthropic/claude-opus-4.5

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use.

Context
200K
Input
$5 / 1M
Output
$25 / 1M
Released
Nov 24, 2025
Modalities
fileimagetexttext

Benchmarks

Scores on standardized evaluations. Higher is better — and rank shows where Claude Opus 4.5 lands among all models tracked on Model Beat.

58.0
Intelligence Index
Epoch AI
58th percentile of tracked models
72.0
Coding Index
Epoch AI
72nd percentile of tracked models
68.0
Agentic Index
Epoch AI
68th percentile of tracked models

Reasoning

7 evals
ARC-AGI80.0%

Abstract visual reasoning

ARC-AGI-237.6%

Harder abstract reasoning

GPQA Diamond86.0%

Graduate-level scientific reasoning

Humanity's Last Exam25.2%

Frontier of human expert knowledge

SimpleBench62.0%

Common-sense trick questions

SimpleQA Verified41.8%

Factual accuracy & hallucination

WeirdML63.7%

Novel ML problem-solving

Coding

3 evals
GSO (code optimization)26.5%

Code performance optimization

SWE-bench Verified76.7%

Real GitHub issue resolution

WebDev Arena1512

Human-rated web development

Math

3 evals
AIME 2024/202586.1%

Olympiad-qualifier math

FrontierMath20.7%

Research-level math problems

FrontierMath Tier 44.2%

Hardest research math

Agentic & Tools

4 evals
APEX18.4%

Multi-step agentic tasks

GDPval (win/tie rate)59.6%

Economically valuable work

METR task horizon4.9 h

Autonomous task length

Terminal-Bench63.1%

Command-line agentic tasks

Model & benchmark data from Epoch AI (CC BY); pricing, specs & descriptions from OpenRouter.