Winner's curse
ConceptMentioned in 1 video
A phenomenon where the performance of a selected model is overstated due to statistical fluctuations, a concern addressed by LMSys's live benchmark.
A phenomenon where the performance of a selected model is overstated due to statistical fluctuations, a concern addressed by LMSys's live benchmark.