dual chunk attention

Concept

An optimization technique employed in Qwen 3's long context stage to efficiently process sequences.

Mentioned in 1 video