Chatbot Arena
A leaderboard / ranking site for comparing chat models (mentioned as a way to track models).
Save the 4 videos on Chatbot Arena to your own pod.
Sign up free to keep building your knowledge base on Chatbot Arena as more episodes are added.
Videos Mentioning Chatbot Arena

How I use LLMs
Andrej Karpathy
A leaderboard / ranking site for comparing chat models (mentioned as a way to track models).

In the Arena: How LMSys changed LLM Benchmarking Forever
Latent Space
A platform by LMSys for crowdsourced LLM benchmarking where users compare anonymized models side-by-side.

The Origin and Future of RLHF: the secret ingredient for ChatGPT - with Nathan Lambert
Latent Space
A platform by LM Cys for limited evaluation of language models, valuable for understanding user interaction, and showed GPT-4 Turbo's superior performance.

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 12: Evaluation
Stanford Online
A platform for evaluating language models through pairwise comparisons rated by humans, formerly known as Chapot Arena. It uses Elo rankings to determine model performance.