Babbitt Task
Software / App
A toy problem proposed to test AI reasoning and working memory capabilities, considered a useful benchmark.
Mentioned in 1 video
A toy problem proposed to test AI reasoning and working memory capabilities, considered a useful benchmark.