L

LayerNorm

Tool / ProductMentioned in 1 video

Layer normalization used in the Transformer; the lecture implements pre-norm LayerNorm to stabilize deep network training.