Quantization

ConceptMentioned in 1 video

A technique to reduce the precision of model weights, leading to smaller and faster models.