RoPE

ConceptMentioned in 2 videos

A position embedding method used in Transformer architectures, appreciated for its extrapolation properties in long context models.