E
Epoch AI
OrganizationMentioned in 1 video
Creator of the 'Frontier Math' benchmark, on which O3 scored around 25%, and noted for exposing potential LLM scheming.
Creator of the 'Frontier Math' benchmark, on which O3 scored around 25%, and noted for exposing potential LLM scheming.