💡 : If you are looking for the software/machine learning paper, search for "QLoRA" or "4-bit NormalFloat" on arXiv .
: A process that quantizes the quantization constants themselves to save additional memory. NF4.rar
: An information-theoretically optimal data type for normally distributed weights. It uses 16 quantization levels based on the quantiles of a standard normal distribution. 💡 : If you are looking for the
The paper explains why NF4 is superior to standard 4-bit integers (Int4) or floating-point (Float4) formats: NF4.rar