Lead author of GPQA; notes on potential noise in benchmark data and training influence.
Mentioned in 1 video
AI Explained