Reinforcement Learning from Human Feedback, Paul Cristiano is identified as its inventor.
Y Combinator