Open LLM Leaderboards

Software / App

A benchmark previously run by Hugging Face, now being phased out in favor of agentic evaluations.

Mentioned in 1 video