summaryrefslogtreecommitdiff
path: root/iqk-quantize.cpp
AgeCommit message (Expand)Author
2024-07-27Merge mainline llama.cpp (#3)Kawrakow
2024-07-24Add copyright noticesIwan Kawrakow
2024-07-24Remove unused fileIwan Kawrakow
2024-07-17Fix Makefile, add GGML_USE_IQK_MULMAT ifdefs to iqk-quantizeIwan Kawrakow
2024-07-17iq1bn: faster scalar dot productIwan Kawrakow
2024-07-17iq1bn: fix scalar dot productIwan Kawrakow
2024-07-17iq1bn: adjust scalar dot product and some cleanupIwan Kawrakow
2024-07-17iq1bn(no lookup): better versionIwan Kawrakow
2024-07-15iq1bn: attempt without a lookup tableIwan Kawrakow
2024-06-25bitnet: remove iq1_bn lookup table storing +/- signsIwan Kawrakow
2024-06-25bitnet: simdify q8_K64 quantization on AVXIwan Kawrakow
2024-06-25bitnet: NEON improvements for iq1_bnIwan Kawrakow
2024-06-25Bitnet: trying an alternative iq1_bn gridIwan Kawrakow
2024-06-25bitnet: fix scalar dot product for 1.625 bpwIwan Kawrakow
2024-06-22bitnet: qnfs testsIwan Kawrakow
2024-06-22bitnet(scale in a separate tensor): more CPU improvementsIwan Kawrakow
2024-06-22bitnet(scale in a separate tensor): CPU improvementsIwan Kawrakow
2024-06-22bitnet: put the scale in a separate tensorIwan Kawrakow
2024-06-22Bitnet(1.75 bpw): higher precision fp8 scaleIwan Kawrakow
2024-06-22Bitnet: 2.25 bpw versionIwan Kawrakow
2024-06-22bitnet: add 2 bpw quantizationIwan Kawrakow
2024-06-22Move Q8_K64 quantization to iqk-quantize.cpp and add copyright noticeIwan Kawrakow
2024-06-22bitnet: fix scalar dot productIwan Kawrakow
2024-06-22bitnet: python + llamaIwan Kawrakow