anthropic/claude-opus-4.1Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performance in coding, reasoning, and agentic tasks.
Scores on standardized evaluations. Higher is better — and rank shows where Claude Opus 4.1 lands among all models tracked on Model Beat.
Graduate-level scientific reasoning
Frontier of human expert knowledge
Common-sense trick questions
Factual accuracy & hallucination
Novel ML problem-solving
Real GitHub issue resolution
Human-rated web development
Olympiad-qualifier math
Research-level math problems
Hardest research math
Economically valuable work
Autonomous task length
Command-line agentic tasks
Model & benchmark data from Epoch AI (CC BY); pricing, specs & descriptions from OpenRouter.