GSM 8K
Study / Research
A benchmark for AI mathematical reasoning designed for high schoolers, found to have errors in its original design.
Mentioned in 1 video
A benchmark for AI mathematical reasoning designed for high schoolers, found to have errors in its original design.