S

Simple Bench

Tool / Product

A fast LLM benchmark focusing on trick questions and common sense reasoning; used to compare model progress.

Mentioned in 1 video