Disaggregated Inference

Concept

NVIDIA's fundamental technology for its AI factory, involving breaking down the inference pipeline to run on different GPUs for efficiency.

Mentioned in 1 video