OpenAI: GPT-5.1

Name: GPT-5.1
Author: OpenAI

openai/gpt-5.1

GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style compared to GPT-5.

Context

400K

Input

$1.25 / 1M

Output

$10 / 1M

Released

Nov 13, 2025

Modalities

imagetextfile→text

Benchmarks

Scores on standardized evaluations. Higher is better — and rank shows where GPT-5.1 lands among all models tracked on Model Beat.

48.0

Intelligence Index

Epoch AI

48th percentile of tracked models

35.0

Coding Index

Epoch AI

35th percentile of tracked models

50.0

Agentic Index

Epoch AI

50th percentile of tracked models

Reasoning

7 evals

ARC-AGI72.8%

Abstract visual reasoning

ARC-AGI-217.6%

Harder abstract reasoning

GPQA Diamond87.6%

Graduate-level scientific reasoning

Humanity's Last Exam23.7%

Frontier of human expert knowledge

SimpleBench53.2%

Common-sense trick questions

SimpleQA Verified48.9%

Factual accuracy & hallucination

WeirdML60.8%

Novel ML problem-solving

Coding

3 evals

GSO (code optimization)13.7%

Code performance optimization

SWE-bench Verified68.0%

Real GitHub issue resolution

WebDev Arena1387

Human-rated web development

Math

3 evals

AIME 2024/202588.6%

Olympiad-qualifier math

FrontierMath31.0%

Research-level math problems

FrontierMath Tier 412.5%

Hardest research math

Agentic & Tools

2 evals

APEX17.5%

Multi-step agentic tasks

Terminal-Bench47.6%

Command-line agentic tasks

In the news

How Tolan builds voice-first AI with GPT-5.1OpenAI Blog · Jan 7, 2026
Building more with GPT-5.1-Codex-MaxOpenAI Blog · Nov 19, 2025
GPT-5.1-Codex-Max System CardOpenAI Blog · Nov 19, 2025
Introducing GPT-5.1 for developersOpenAI Blog · Nov 13, 2025
GPT-5.1 Instant and GPT-5.1 Thinking System Card AddendumOpenAI Blog · Nov 12, 2025
GPT-5.1: A smarter, more conversational ChatGPTOpenAI Blog · Nov 12, 2025

Model & benchmark data from Epoch AI (CC BY); pricing, specs & descriptions from OpenRouter.