LM as a Judge

Concept

A methodology explored by LMSys to use LLMs themselves to judge or evaluate other LLMs, aiming for automated high-quality signals.

Mentioned in 1 video