moonshotai/kimi-k2.5Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm.
Scores on standardized evaluations. Higher is better — and rank shows where Kimi K2.5 lands among all models tracked on Model Beat.
Abstract visual reasoning
Harder abstract reasoning
Graduate-level scientific reasoning
Frontier of human expert knowledge
Common-sense trick questions
Factual accuracy & hallucination
Novel ML problem-solving
Real GitHub issue resolution
Human-rated web development
Olympiad-qualifier math
Research-level math problems
Hardest research math
Multi-step agentic tasks
Command-line agentic tasks
Model & benchmark data from Epoch AI (CC BY); pricing, specs & descriptions from OpenRouter.