L
LayerNorm
Tool / ProductMentioned in 1 video
Layer normalization used in the Transformer; the lecture implements pre-norm LayerNorm to stabilize deep network training.
Layer normalization used in the Transformer; the lecture implements pre-norm LayerNorm to stabilize deep network training.