summaryrefslogtreecommitdiff
path: root/tests/test-quantize-fns.cpp
AgeCommit message (Expand)Author
2024-07-27Merge mainline llama.cpp (#3)Kawrakow
2024-06-22bitnet: qnfs testsIwan Kawrakow
2024-03-25tests : include IQ2_XXS and IQ2_XS in test-quantize-fns (#6303)Kawrakow
2024-02-26Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantization range...Kawrakow
2024-02-24IQ3_S: a much better alternative to Q3_K (#5676)Kawrakow
2024-02-11ggml : add mmla kernels for quantized GEMM (#4966)snadampal
2024-01-30SOTA 3-bit quants (#5196)Kawrakow
2024-01-11ggml : SOTA 2-bit quants (add IQ2_XS) (#4856)Kawrakow
2024-01-08SOTA 2-bit quants (#4773)Kawrakow
2023-10-30ggml : move FP16 <-> FP32 code to ggml-impl.h (#3861)Georgi Gerganov
2023-09-15check C++ code with -Wmissing-declarations (#3184)Cebtenzzre
2023-07-05ggml : generalize `quantize_fns` for simpler FP16 handling (#1237)Stephan Walter
2023-06-16build : fix and ignore MSVC warnings (#1889)Borislav Stanimirov
2023-06-05ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)Kawrakow
2023-04-25ggml : add Q8_0 quantization format (rename the old one to Q8_1) (ARM NEON) (...Georgi Gerganov
2023-04-22Fix CI: ARM NEON, quantization unit tests, editorconfig (#1122)Stephan Walter
2023-04-22ggml : unit test for quantization functions (#953)unbounded