DeepSpeed

Software / App

Mentioned as a system from Microsoft that uses data parallelism, contrasted with Cerebras' weight streaming approach.

Mentioned in 3 videos