GSM 8K

Study / Research

A benchmark for AI mathematical reasoning designed for high schoolers, found to have errors in its original design.

Mentioned in 1 video