NF4
Concept
A 4-bit Normal Float quantization format used by Q-LoRA, significantly reducing memory usage for model weights.
Mentioned in 1 video
A 4-bit Normal Float quantization format used by Q-LoRA, significantly reducing memory usage for model weights.