summaryrefslogtreecommitdiff
path: root/iqk-quantize.cpp
diff options
context:
space:
mode:
authorIwan Kawrakow <iwan.kawrakow@gmail.com>2024-07-16 09:12:15 +0200
committerIwan Kawrakow <iwan.kawrakow@gmail.com>2024-07-16 09:12:15 +0200
commit52a25e307c3af8686436d977c60e9975b0900e2b (patch)
tree17240797a176f63bf759d0a68410dc53a6c5913d /iqk-quantize.cpp
parent6393e2682720893092f77c2a6d428a2c13ecccf7 (diff)
iq1bn(no lookup): Metal
In summary, compared to lookup, the multiplication based approach is * Much better on AVX2 * Slightly better on CUDA * Slightly worse on Metal * Much worse on NEON
Diffstat (limited to 'iqk-quantize.cpp')
0 files changed, 0 insertions, 0 deletions