Adaptive Layer Norm
Concept
A method for injecting external signals (time step, class label) into the Diffusion Transformer by modulating patch embeddings via learned gate, scale, and shift coefficients, found to be the most performant.
Mentioned in 1 video
