ELO scores
Concept
Used as chess ratings to benchmark AI models and compare their performance.
Mentioned in 2 videos
Videos Mentioning ELO scores

In the Arena: How LMSys changed LLM Benchmarking Forever
Latent Space
A system for ranking models that Chatbot Arena uses, considered a revolution in LLM benchmarking.

Scaling and the Road to Human-Level AI | Anthropic Co-founder Jared Kaplan
Y Combinator
Used as chess ratings to benchmark AI models and compare their performance.