InstructGPT

Software / App

GPT by OpenAI as language model

Mentioned in 5 videos

Save the 5 videos on InstructGPT to your own pod.

Get Started Free

Videos Mentioning InstructGPT

The REAL potential of generative AI

Y Combinator

A model developed by OpenAI that, despite being smaller, showed significant preferred performance over larger models when instruction-tuned and using Reinforcement Learning from Human Feedback (RLHF).

The Origin and Future of RLHF: the secret ingredient for ChatGPT - with Nathan Lambert

Latent Space

An OpenAI model that demonstrated the three-step RLHF process and produced 'incredibly pretty plots' of performance improvement. It tried to match the instruction tuning model to constrain the distribution.

Beating GPT-4 with Open Source Models - with Michael Royzen of Phind

Latent Space

An earlier version of the GPT models fine-tuned to follow instructions, mentioned as a predecessor to advanced instruction-tuned models.

Alexandr Wang: Building Scale AI, Transforming Work With Agents & Competing With China

Y Combinator

A precursor to ChatGPT developed with OpenAI, which became a significant turning point for Scale AI.

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 15: Mid/Post-Training

Stanford Online

Its appendix provides a glimpse into industry data collection processes for RLHF, detailing guidelines for rating outputs on helpfulness, truthfulness, and harmlessness.