O

Open LLM Leaderboards

Tool / ProductMentioned in 1 video

A benchmark previously run by Hugging Face, now being phased out in favor of agentic evaluations.