summaryrefslogtreecommitdiff
path: root/iqk-quantize.cpp
diff options
context:
space:
mode:
authorIwan Kawrakow <iwan.kawrakow@gmail.com>2024-06-17 19:07:38 +0300
committerIwan Kawrakow <iwan.kawrakow@gmail.com>2024-06-22 12:02:52 +0300
commit661698513587fea89191a08b9c28a1f5619ebac8 (patch)
tree209dc4d7a09d2523c0e5dd7b832e4f2bc51c3448 /iqk-quantize.cpp
parentf6863cfa1bbc5ac42b78837b355e45d82246a472 (diff)
bitnet 2 bpw: AVX2 implementation
We get PP-512 = 322 t/s. TG is already 51.6 t/s at 4 threads, then it saturates and starts going down for more than 8 threads.
Diffstat (limited to 'iqk-quantize.cpp')
0 files changed, 0 insertions, 0 deletions