G

GELU

Tool / ProductMentioned in 1 video

A nonlinearity used in the Multilayer Perceptron (MLP) / feed-forward block (noted because OpenAI uses it).