Models/Claude Opus 4.5

Anthropic: Claude Opus 4.5

Name: Claude Opus 4.5
Author: Anthropic

anthropic/claude-opus-4.5

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use.

Context

200K

Input

$5 / 1M

Output

$25 / 1M

Released

Nov 24, 2025

Modalities

fileimagetext→text

Benchmarks

Scores on standardized evaluations. Higher is better — and rank shows where Claude Opus 4.5 lands among all models tracked on Model Beat.

58.0

Intelligence Index

Epoch AI

58th percentile of tracked models

72.0

Coding Index

Epoch AI

72nd percentile of tracked models

68.0

Agentic Index

Epoch AI

68th percentile of tracked models

Reasoning

7 evals

ARC-AGI80.0%

Abstract visual reasoning

ARC-AGI-237.6%

Harder abstract reasoning

GPQA Diamond86.0%

Graduate-level scientific reasoning

Humanity's Last Exam25.2%

Frontier of human expert knowledge

SimpleBench62.0%

Common-sense trick questions

SimpleQA Verified41.8%

Factual accuracy & hallucination

WeirdML63.7%

Novel ML problem-solving

Coding

3 evals

GSO (code optimization)26.5%

Code performance optimization

SWE-bench Verified76.7%

Real GitHub issue resolution

WebDev Arena1512

Human-rated web development

Math

3 evals

AIME 2024/202586.1%

Olympiad-qualifier math

FrontierMath20.7%

Research-level math problems

FrontierMath Tier 44.2%

Hardest research math

Agentic & Tools

4 evals

APEX18.4%

Multi-step agentic tasks

GDPval (win/tie rate)59.6%

Economically valuable work

METR task horizon4.9 h

Autonomous task length

Terminal-Bench63.1%

Command-line agentic tasks

Model & benchmark data from Epoch AI (CC BY); pricing, specs & descriptions from OpenRouter.