| Age | Commit message (Expand) | Author |
|---|---|---|
| 2024-02-18 | 1.5 bit quantization (#5453) | Kawrakow |
| 2024-02-11 | ggml : add mmla kernels for quantized GEMM (#4966) | snadampal |
| 2024-02-05 | ggml : make use of ggml-quants.h possible in C++ code (#5338) | Kawrakow |
| 2024-01-30 | SOTA 3-bit quants (#5196) | Kawrakow |
| 2024-01-17 | ggml : add IQ2 to test-backend-ops + refactoring (#4990) | Georgi Gerganov |
| 2024-01-16 | ggml : importance matrix support for legacy quants (#4969) | Kawrakow |
| 2024-01-14 | Add ability to use importance matrix for all k-quants (#4930) | Kawrakow |
| 2024-01-14 | 2-bit quantizations (#4897) | Kawrakow |
| 2024-01-11 | ggml : SOTA 2-bit quants (add IQ2_XS) (#4856) | Kawrakow |
| 2024-01-08 | SOTA 2-bit quants (#4773) | Kawrakow |
| 2024-01-05 | ggml : fix q2_k bpw in comments (ggml/680) | Georgi Gerganov |
| 2023-10-30 | ggml : move FP16 <-> FP32 code to ggml-impl.h (#3861) | Georgi Gerganov |
| 2023-10-29 | ggml : quantization refactoring (#3833) | Georgi Gerganov |
