LLM Arena

Software / App

A platform for evaluating language models, mentioned as an example of an academic KPI that may not directly correlate with user usefulness.

Mentioned in 3 videos