Quantization

Concept

A technique to reduce the precision of model weights, leading to smaller and faster models.

Mentioned in 1 video