Winner's curse

ConceptMentioned in 1 video

A phenomenon where the performance of a selected model is overstated due to statistical fluctuations, a concern addressed by LMSys's live benchmark.