R

Reinforcement Learning Human Feedback (RLHF)

ConceptMentioned in 1 video

A post-training technique where AI refines its skills based on human feedback, akin to having a mentor or coach.