Scores on standardized evaluations. Higher is better — and rank shows where DeepSeek-V3.1 lands among all models tracked on Model Beat.
Common-sense trick questions
Novel ML problem-solving
Model & benchmark data from Epoch AI (CC BY); pricing, specs & descriptions from OpenRouter.