NF4

Concept

A 4-bit Normal Float quantization format used by Q-LoRA, significantly reducing memory usage for model weights.

Mentioned in 1 video