index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
iqk-quantize.cpp
Age
Commit message (
Expand
)
Author
2024-07-27
Merge mainline llama.cpp (#3)
Kawrakow
2024-07-24
Add copyright notices
Iwan Kawrakow
2024-07-24
Remove unused file
Iwan Kawrakow
2024-07-17
Fix Makefile, add GGML_USE_IQK_MULMAT ifdefs to iqk-quantize
Iwan Kawrakow
2024-07-17
iq1bn: faster scalar dot product
Iwan Kawrakow
2024-07-17
iq1bn: fix scalar dot product
Iwan Kawrakow
2024-07-17
iq1bn: adjust scalar dot product and some cleanup
Iwan Kawrakow
2024-07-17
iq1bn(no lookup): better version
Iwan Kawrakow
2024-07-15
iq1bn: attempt without a lookup table
Iwan Kawrakow
2024-06-25
bitnet: remove iq1_bn lookup table storing +/- signs
Iwan Kawrakow
2024-06-25
bitnet: simdify q8_K64 quantization on AVX
Iwan Kawrakow
2024-06-25
bitnet: NEON improvements for iq1_bn
Iwan Kawrakow
2024-06-25
Bitnet: trying an alternative iq1_bn grid
Iwan Kawrakow
2024-06-25
bitnet: fix scalar dot product for 1.625 bpw
Iwan Kawrakow
2024-06-22
bitnet: qnfs tests
Iwan Kawrakow
2024-06-22
bitnet(scale in a separate tensor): more CPU improvements
Iwan Kawrakow
2024-06-22
bitnet(scale in a separate tensor): CPU improvements
Iwan Kawrakow
2024-06-22
bitnet: put the scale in a separate tensor
Iwan Kawrakow
2024-06-22
Bitnet(1.75 bpw): higher precision fp8 scale
Iwan Kawrakow
2024-06-22
Bitnet: 2.25 bpw version
Iwan Kawrakow
2024-06-22
bitnet: add 2 bpw quantization
Iwan Kawrakow
2024-06-22
Move Q8_K64 quantization to iqk-quantize.cpp and add copyright notice
Iwan Kawrakow
2024-06-22
bitnet: fix scalar dot product
Iwan Kawrakow
2024-06-22
bitnet: python + llama
Iwan Kawrakow