Open LLM Leaderboards

Software / AppMentioned in 1 video

A benchmark previously run by Hugging Face, now being phased out in favor of agentic evaluations.