GELU

Concept

A nonlinearity used in the Multilayer Perceptron (MLP) / feed-forward block (noted because OpenAI uses it).

Mentioned in 1 video