GELU

ConceptMentioned in 1 video

A nonlinearity used in the Multilayer Perceptron (MLP) / feed-forward block (noted because OpenAI uses it).