summaryrefslogtreecommitdiff
path: root/ggml.c
diff options
context:
space:
mode:
authorIwan Kawrakow <iwan.kawrakow@gmail.com>2024-06-08 09:02:23 +0300
committerIwan Kawrakow <iwan.kawrakow@gmail.com>2024-06-22 12:02:50 +0300
commitcd3d8ae0e719b47fb0ef63b0f7b9e1dacbab7de1 (patch)
tree5ea228ac8f54d743889a2df699af65b396d9048e /ggml.c
parent299c7f6e89d2d8c4162be06463b82a07540d5691 (diff)
iqk_mul_mat: use block_q8_1_x4 also for AVX2
Here the performance gain is more significant. E.g., for q4_1, PP-512 becomes 168 t/s up from 137 t/s. Now the performance gap to q4_0 is so significant that I wonder if I should change to using Q8_1 also for the qX_0 legacy quants.
Diffstat (limited to 'ggml.c')
0 files changed, 0 insertions, 0 deletions