Epoch confirms GPT5.4 Pro solved a Frontier Math Open Problem for the first time

(epoch.ai)

23 points | by in-silico 1 hour ago

1 comments

6thbit 33 minutes ago
> Subsequent to this solve, we finished developing our general scaffold for testing models on FrontierMath: Open Problems. In this scaffold, several other models were able to solve the problem as well: Opus 4.6 (max), Gemini 3.1 Pro, and GPT-5.4 (xhigh).
Interesting. Whats that “scaffold”? A sort of unit test framework for proofs?