Alpaca
Software / App
A language model often boosted by DPO, also part of a popular academic benchmark for evaluating chat capabilities, particularly comparing a candidate model to DaVinci 003.
Mentioned in 2 videos
Save the 2 videos on Alpaca to your own pod.
Sign up free to keep building your knowledge base on Alpaca as more episodes are added.
Videos Mentioning Alpaca

The Origin and Future of RLHF: the secret ingredient for ChatGPT - with Nathan Lambert
Latent Space
A language model often boosted by DPO, also part of a popular academic benchmark for evaluating chat capabilities, particularly comparing a candidate model to DaVinci 003.

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 15: Mid/Post-Training
Stanford Online
Founded by Berkeley researchers, Alpaca used distillation from ChatGPT traces to create input-output pairs, demonstrating that such chat-style data could effectively train ChatGPT-like systems when applied to models like LLaMA.