LayerNorm
ConceptMentioned in 1 video
Layer normalization used in the Transformer; the lecture implements pre-norm LayerNorm to stabilize deep network training.
Layer normalization used in the Transformer; the lecture implements pre-norm LayerNorm to stabilize deep network training.