ModelBeat
Models/GPT-5.2
All models
O

OpenAI: GPT-5.2

openai/gpt-5.2

GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic and long context perfomance compared to GPT-5.1.

Context
400K
Input
$1.75 / 1M
Output
$14 / 1M
Released
Dec 11, 2025
Modalities
fileimagetexttext

Benchmarks

Scores on standardized evaluations. Higher is better — and rank shows where GPT-5.2 lands among all models tracked on Model Beat.

74.0
Intelligence Index
Epoch AI
74th percentile of tracked models
65.0
Coding Index
Epoch AI
65th percentile of tracked models
88.0
Agentic Index
Epoch AI
88th percentile of tracked models

Reasoning

7 evals
ARC-AGI86.2%

Abstract visual reasoning

ARC-AGI-252.9%

Harder abstract reasoning

GPQA Diamond91.4%

Graduate-level scientific reasoning

Humanity's Last Exam27.8%

Frontier of human expert knowledge

SimpleBench45.8%

Common-sense trick questions

SimpleQA Verified38.9%

Factual accuracy & hallucination

WeirdML72.2%

Novel ML problem-solving

Coding

3 evals
GSO (code optimization)27.4%

Code performance optimization

SWE-bench Verified73.8%

Real GitHub issue resolution

WebDev Arena1480

Human-rated web development

Math

3 evals
AIME 2024/202596.1%

Olympiad-qualifier math

FrontierMath40.7%

Research-level math problems

FrontierMath Tier 418.8%

Hardest research math

Agentic & Tools

4 evals
APEX34.3%

Multi-step agentic tasks

GDPval (win/tie rate)70.9%

Economically valuable work

METR task horizon5.9 h

Autonomous task length

Terminal-Bench64.9%

Command-line agentic tasks

In the news

7

Model & benchmark data from Epoch AI (CC BY); pricing, specs & descriptions from OpenRouter.