summaryrefslogtreecommitdiff
path: root/iqk_mul_mat.cpp
AgeCommit message (Expand)Author
2024-06-22iqk_mul_mat: be independent of llamafile_sgemm (WIP)Iwan Kawrakow
2024-06-22iqk_mul_mat: be able to handle any f16/f32 combination on AVX2Iwan Kawrakow
2024-06-22iqk_mul_mat: turn on AVX512Iwan Kawrakow
2024-06-22iqk_mul_mat: slightly better fp16 with 16 vector registersIwan Kawrakow
2024-06-22iqk_mul_mat: better fp16 for AVX2Iwan Kawrakow
2024-06-22iqk_mul_mat: fp16 for ArmIwan Kawrakow
2024-06-22iqk_mul_mat: slightly faster FANCY_SIMD dot productIwan Kawrakow
2024-06-22iqk_mul_mat: fix q8_0Iwan Kawrakow
2024-06-22iqk_mul_mat: use block_q8_1_x4 also for AVX2Iwan Kawrakow
2024-06-22iqk_mul_mat: use block_q8_0_x4 also for AVX2Iwan Kawrakow
2024-06-22iqk_mul_mat: delete unused stuffIwan Kawrakow
2024-06-22iqk_mul_mat: add q8_0Iwan Kawrakow
2024-06-22iqk_mul_mat: fp16 tweaksIwan Kawrakow
2024-06-22iqk_mul_mat: fp16 implementation cleanupIwan Kawrakow
2024-06-22iqk_mul_mat: fp16 implementation for AVX2Iwan Kawrakow
2024-06-22iqk_mul_mat: make it independent of sgemmIwan Kawrakow
2024-06-22iqk_mul_mat: minor improvementsIwan Kawrakow
2024-06-22iqk_mul_mat: no more templates in the IQ dequantizersIwan Kawrakow
2024-06-22iqk_mul_mat: remove template on one of the prepare() functionsIwan Kawrakow
2024-06-22iqk_mul_mat: experimenting with zen4Iwan Kawrakow
2024-06-22iqk_mul_mat: experimenting with zen4 (iq2_xxs)Iwan Kawrakow
2024-06-22iqk_mul_mat: experimenting with zen4 (iq2_xs)Iwan Kawrakow
2024-06-22iqk_mul_mat: experimenting with zen4 (iq3_s and iq2_m)Iwan Kawrakow
2024-06-22iqk_mul_mat: small improvement for iq3_sIwan Kawrakow
2024-06-22iqk_mul_mat: better AVX2 implementation for iq2_xxsIwan Kawrakow
2024-06-22iqk_mul_mat: better AVX2 implementation for iq2_xxsIwan Kawrakow
2024-06-22iqk_mul_mat: AVX2 implementation for iq2_xxsIwan Kawrakow
2024-06-22iqk_mul_mat: AVX2 implementation for iq2_xsIwan Kawrakow
2024-06-22iqk_mul_mat: AVX2 implementation for iq2_sIwan Kawrakow
2024-06-22Separate templates for TG and PP for i-quants on AVX2Iwan Kawrakow
2024-06-22iqk_mul_mat: AVX2 implementation for iq3_xxsIwan Kawrakow
2024-06-22iqk_mul_mat: AVX2 implementation for iq3_sIwan Kawrakow
2024-06-22Cleanup - Arm i-quants should be good nowIwan Kawrakow
2024-06-22iqk_mul_mat: Arm implementation for iq3_s (llama.cpp version)Iwan Kawrakow
2024-06-22SimplifyIwan Kawrakow
2024-06-22iqk_mul_mat: Arm implementation for iq3_xxs (llama.cpp version)Iwan Kawrakow
2024-06-22iqk_mul_mat: Arm implementation for iq2_xs (llama.cpp version)Iwan Kawrakow
2024-06-22iqk_mul_mat: Arm implementation for iq2_s (llama.cpp version)Iwan Kawrakow
2024-06-22Add Q8_0Iwan Kawrakow
2024-06-22CosmeticsIwan Kawrakow
2024-06-22iqk_mul_mat: Arm implementation for iq2_xxs (llama.cpp version)Iwan Kawrakow
2024-06-22iqk_mul_mat: faster q3_K TGIwan Kawrakow
2024-06-22iqk_mul_mat for llama.cppIwan Kawrakow