Chatbot Arena
Software / AppMentioned in 3 videos
A leaderboard / ranking site for comparing chat models (mentioned as a way to track models).
Videos Mentioning Chatbot Arena

How I use LLMs
Andrej Karpathy
A leaderboard / ranking site for comparing chat models (mentioned as a way to track models).

In the Arena: How LMSys changed LLM Benchmarking Forever
Latent Space
A platform by LMSys for crowdsourced LLM benchmarking where users compare anonymized models side-by-side.

The Origin and Future of RLHF: the secret ingredient for ChatGPT - with Nathan Lambert
Latent Space
A platform by LM Cys for limited evaluation of language models, valuable for understanding user interaction, and showed GPT-4 Turbo's superior performance.