RHF

Concept

Reinforcement Learning from Human Feedback, a training paradigm mentioned as a way to go beyond human performance.

Mentioned in 1 video