diff options
author | Iwan Kawrakow <iwan.kawrakow@gmail.com> | 2024-06-18 18:42:26 +0300 |
---|---|---|
committer | Iwan Kawrakow <iwan.kawrakow@gmail.com> | 2024-06-22 12:02:52 +0300 |
commit | 181fd9c56eaa64d0a92f9e8be7387f409cfa8745 (patch) | |
tree | 4df08e0200a38763b53d3635e86ca7980a99f3ae /iqk-quantize.cpp | |
parent | fece7e1db7bf73497a32751af06c6dbf48c26b19 (diff) |
Bitnet(1.75 bpw): slightly faster CUDA dot product
We get 205 t/s, so ~13% slower than 2 bit.
Diffstat (limited to 'iqk-quantize.cpp')
0 files changed, 0 insertions, 0 deletions