R
Reinforcement Learning Human Feedback (RLHF)
ConceptMentioned in 1 video
A post-training technique where AI refines its skills based on human feedback, akin to having a mentor or coach.
A post-training technique where AI refines its skills based on human feedback, akin to having a mentor or coach.