Tensor Parallelism

ConceptMentioned in 1 video

A distributed training technique that splits model tensors across multiple devices.