InstructGPT
GPT by OpenAI as language model
Save the 5 videos on InstructGPT to your own pod.
Sign up free to keep building your knowledge base on InstructGPT as more episodes are added.
Videos Mentioning InstructGPT

The REAL potential of generative AI
Y Combinator
A model developed by OpenAI that, despite being smaller, showed significant preferred performance over larger models when instruction-tuned and using Reinforcement Learning from Human Feedback (RLHF).

The Origin and Future of RLHF: the secret ingredient for ChatGPT - with Nathan Lambert
Latent Space
An OpenAI model that demonstrated the three-step RLHF process and produced 'incredibly pretty plots' of performance improvement. It tried to match the instruction tuning model to constrain the distribution.

Beating GPT-4 with Open Source Models - with Michael Royzen of Phind
Latent Space
An earlier version of the GPT models fine-tuned to follow instructions, mentioned as a predecessor to advanced instruction-tuned models.

Alexandr Wang: Building Scale AI, Transforming Work With Agents & Competing With China
Y Combinator
A precursor to ChatGPT developed with OpenAI, which became a significant turning point for Scale AI.

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 15: Mid/Post-Training
Stanford Online
Its appendix provides a glimpse into industry data collection processes for RLHF, detailing guidelines for rating outputs on helpfulness, truthfulness, and harmlessness.