ModelBeat
Models/gpt-oss-120b
All models
O

OpenAI: gpt-oss-120b

openai/gpt-oss-120b

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases.

Context
131K
Input
$0.04 / 1M
Output
$0.18 / 1M
Released
Aug 5, 2025
Modalities
texttext

Benchmarks

Scores on standardized evaluations. Higher is better — and rank shows where gpt-oss-120b lands among all models tracked on Model Beat.

19.0
Intelligence Index
Epoch AI
19th percentile of tracked models
5.0
Coding Index
Epoch AI
5th percentile of tracked models
6.0
Agentic Index
Epoch AI
6th percentile of tracked models

Reasoning

4 evals
GPQA Diamond75.8%

Graduate-level scientific reasoning

SimpleBench22.1%

Common-sense trick questions

SimpleQA Verified13.9%

Factual accuracy & hallucination

WeirdML48.2%

Novel ML problem-solving

Math

1 evals
AIME 2024/202588.9%

Olympiad-qualifier math

Agentic & Tools

3 evals
APEX4.7%

Multi-step agentic tasks

METR task horizon42 min

Autonomous task length

Terminal-Bench18.7%

Command-line agentic tasks

In the news

2

Model & benchmark data from Epoch AI (CC BY); pricing, specs & descriptions from OpenRouter.