index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
tests
/
test-quantize-fns.cpp
Age
Commit message (
Expand
)
Author
2024-07-27
Merge mainline llama.cpp (#3)
Kawrakow
2024-06-22
bitnet: qnfs tests
Iwan Kawrakow
2024-03-25
tests : include IQ2_XXS and IQ2_XS in test-quantize-fns (#6303)
Kawrakow
2024-02-26
Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantization range...
Kawrakow
2024-02-24
IQ3_S: a much better alternative to Q3_K (#5676)
Kawrakow
2024-02-11
ggml : add mmla kernels for quantized GEMM (#4966)
snadampal
2024-01-30
SOTA 3-bit quants (#5196)
Kawrakow
2024-01-11
ggml : SOTA 2-bit quants (add IQ2_XS) (#4856)
Kawrakow
2024-01-08
SOTA 2-bit quants (#4773)
Kawrakow
2023-10-30
ggml : move FP16 <-> FP32 code to ggml-impl.h (#3861)
Georgi Gerganov
2023-09-15
check C++ code with -Wmissing-declarations (#3184)
Cebtenzzre
2023-07-05
ggml : generalize `quantize_fns` for simpler FP16 handling (#1237)
Stephan Walter
2023-06-16
build : fix and ignore MSVC warnings (#1889)
Borislav Stanimirov
2023-06-05
ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)
Kawrakow
2023-04-25
ggml : add Q8_0 quantization format (rename the old one to Q8_1) (ARM NEON) (...
Georgi Gerganov
2023-04-22
Fix CI: ARM NEON, quantization unit tests, editorconfig (#1122)
Stephan Walter
2023-04-22
ggml : unit test for quantization functions (#953)
unbounded