summaryrefslogtreecommitdiff
path: root/ggml/src/iqk/iqk_quantize.cpp
AgeCommit message (Expand)Author
2024-08-09iq6_k: WIP (quantize/dequantize)Iwan Kawrakow
2024-08-09iq6_k: WIP (nothing works)Iwan Kawrakow
2024-08-07Adding IQ2_TN for use with ternary models (#13)Kawrakow
2024-08-05iq3_k, iq5_k: faster quantizationIwan Kawrakow
2024-08-03iq4_k: speedup quantization by a factor of ~2Iwan Kawrakow
2024-08-01iq3_k: BasicsIwan Kawrakow
2024-08-01iq5_k: BasicsIwan Kawrakow
2024-08-01iq2_k: BasicsIwan Kawrakow
2024-07-28IQ4_K: SOTA 4-bit quantization (#6)Kawrakow
2024-07-27Merge mainline llama.cpp (#3)Kawrakow