CPU DRAM

Product

Used for storing KV cache when GPU memory runs out, crucial for large-scale inference systems.

Mentioned in 1 video