Babbitt Task

Software / App

A toy problem proposed to test AI reasoning and working memory capabilities, considered a useful benchmark.

Mentioned in 1 video