Adam
A popular optimization algorithm for training deep learning models, mentioned in the context of hyperparameter tuning.
Save the 4 videos on Adam to your own pod.
Sign up free to keep building your knowledge base on Adam as more episodes are added.
Videos Mentioning Adam

Jeremy Howard: fast.ai Deep Learning Courses and Research | Lex Fridman Podcast #35
Lex Fridman
A popular optimization algorithm for training deep learning models, mentioned in the context of hyperparameter tuning.

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 2: PyTorch (einops)
Stanford Online
An optimization algorithm that combines momentum and adaptive learning rates, mentioned in contrast to Adagrad and relevant for Assignment 1.

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 8: Parallelism
Stanford Online
An optimization algorithm that requires tracking first and second moments of gradients, leading to significant memory overhead.

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 9: Scaling Laws
Stanford Online
A common optimization algorithm, discussed in relation to SGD and scaling laws, showing that different optimizers often have similar scaling behaviors.