ModelBeat
Models/GPT-5.1
All models
O

OpenAI: GPT-5.1

openai/gpt-5.1

GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style compared to GPT-5.

Context
400K
Input
$1.25 / 1M
Output
$10 / 1M
Released
Nov 13, 2025
Modalities
imagetextfiletext

Benchmarks

Scores on standardized evaluations. Higher is better — and rank shows where GPT-5.1 lands among all models tracked on Model Beat.

48.0
Intelligence Index
Epoch AI
48th percentile of tracked models
35.0
Coding Index
Epoch AI
35th percentile of tracked models
50.0
Agentic Index
Epoch AI
50th percentile of tracked models

Reasoning

7 evals
ARC-AGI72.8%

Abstract visual reasoning

ARC-AGI-217.6%

Harder abstract reasoning

GPQA Diamond87.6%

Graduate-level scientific reasoning

Humanity's Last Exam23.7%

Frontier of human expert knowledge

SimpleBench53.2%

Common-sense trick questions

SimpleQA Verified48.9%

Factual accuracy & hallucination

WeirdML60.8%

Novel ML problem-solving

Coding

3 evals
GSO (code optimization)13.7%

Code performance optimization

SWE-bench Verified68.0%

Real GitHub issue resolution

WebDev Arena1387

Human-rated web development

Math

3 evals
AIME 2024/202588.6%

Olympiad-qualifier math

FrontierMath31.0%

Research-level math problems

FrontierMath Tier 412.5%

Hardest research math

Agentic & Tools

2 evals
APEX17.5%

Multi-step agentic tasks

Terminal-Bench47.6%

Command-line agentic tasks

In the news

6

Model & benchmark data from Epoch AI (CC BY); pricing, specs & descriptions from OpenRouter.