Adam

Software / App

A popular optimization algorithm for training deep learning models, mentioned in the context of hyperparameter tuning.

Mentioned in 4 videos

Save the 4 videos on Adam to your own pod.

Get Started Free

Videos Mentioning Adam

Jeremy Howard: fast.ai Deep Learning Courses and Research | Lex Fridman Podcast #35

Lex Fridman

A popular optimization algorithm for training deep learning models, mentioned in the context of hyperparameter tuning.

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 2: PyTorch (einops)

Stanford Online

An optimization algorithm that combines momentum and adaptive learning rates, mentioned in contrast to Adagrad and relevant for Assignment 1.

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 8: Parallelism

Stanford Online

An optimization algorithm that requires tracking first and second moments of gradients, leading to significant memory overhead.

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 9: Scaling Laws

Stanford Online

A common optimization algorithm, discussed in relation to SGD and scaling laws, showing that different optimizers often have similar scaling behaviors.