Simple Bench
Software / App
A fast LLM benchmark focusing on trick questions and common sense reasoning; used to compare model progress.
Mentioned in 1 video
A fast LLM benchmark focusing on trick questions and common sense reasoning; used to compare model progress.