Age | Commit message (Expand) | Author |
---|---|---|
2024-01-17 | ggml : add IQ2 to test-backend-ops + refactoring (#4990) | Georgi Gerganov |
2024-01-16 | ggml : importance matrix support for legacy quants (#4969) | Kawrakow |
2024-01-14 | Add ability to use importance matrix for all k-quants (#4930) | Kawrakow |
2024-01-14 | 2-bit quantizations (#4897) | Kawrakow |
2024-01-11 | ggml : SOTA 2-bit quants (add IQ2_XS) (#4856) | Kawrakow |
2024-01-08 | SOTA 2-bit quants (#4773) | Kawrakow |
2024-01-05 | ggml : fix q2_k bpw in comments (ggml/680) | Georgi Gerganov |
2023-10-30 | ggml : move FP16 <-> FP32 code to ggml-impl.h (#3861) | Georgi Gerganov |
2023-10-29 | ggml : quantization refactoring (#3833) | Georgi Gerganov |