Pipeline Parallelism

Concept

A distributed training technique where different stages (layers) of the model are placed on different devices, processed sequentially with micro-batches.

Mentioned in 1 video