RoPE

Concept

A position embedding method used in Transformer architectures, appreciated for its extrapolation properties in long context models.

Mentioned in 2 videos