ModelBeat
Models/Grok 4
All models
X
grok-4

Grok 4 is a Grok-family AI model, from xAI, released Jul 9, 2025.

Context
Input
/ 1M
Output
/ 1M
Released
Jul 9, 2025
Modalities

Benchmarks

Scores on standardized evaluations. Higher is better — and rank shows where Grok 4 lands among all models tracked on Model Beat.

26.0
Intelligence Index
Epoch AI
26th percentile of tracked models
15.0
Coding Index
Epoch AI
15th percentile of tracked models
16.0
Agentic Index
Epoch AI
16th percentile of tracked models

Reasoning

6 evals
ARC-AGI66.7%

Abstract visual reasoning

ARC-AGI-216.0%

Harder abstract reasoning

GPQA Diamond87.0%

Graduate-level scientific reasoning

SimpleBench60.5%

Common-sense trick questions

SimpleQA Verified47.9%

Factual accuracy & hallucination

WeirdML45.7%

Novel ML problem-solving

Math

3 evals
AIME 2024/202584.0%

Olympiad-qualifier math

FrontierMath19.7%

Research-level math problems

FrontierMath Tier 42.1%

Hardest research math

Agentic & Tools

4 evals
APEX15.2%

Multi-step agentic tasks

GDPval (win/tie rate)24.3%

Economically valuable work

METR task horizon1.8 h

Autonomous task length

Terminal-Bench27.2%

Command-line agentic tasks

Model & benchmark data from Epoch AI (CC BY); pricing, specs & descriptions from OpenRouter.