Ring Attention

Concept

A parallelism strategy specifically designed to efficiently partition large sequences, particularly in attention mechanisms.

Mentioned in 1 video