Open LLM Leaderboards
Software / App
A benchmark previously run by Hugging Face, now being phased out in favor of agentic evaluations.
Mentioned in 1 video
A benchmark previously run by Hugging Face, now being phased out in favor of agentic evaluations.