A
Adam optimizer
Tool / ProductMentioned in 2 videos
The optimization algorithm chosen for training in the lecture (recommended default for training transformers).
The optimization algorithm chosen for training in the lecture (recommended default for training transformers).