summaryrefslogtreecommitdiff
path: root/iqk-quantize.cpp
diff options
context:
space:
mode:
authorIwan Kawrakow <iwan.kawrakow@gmail.com>2024-06-18 18:42:26 +0300
committerIwan Kawrakow <iwan.kawrakow@gmail.com>2024-06-22 12:02:52 +0300
commit181fd9c56eaa64d0a92f9e8be7387f409cfa8745 (patch)
tree4df08e0200a38763b53d3635e86ca7980a99f3ae /iqk-quantize.cpp
parentfece7e1db7bf73497a32751af06c6dbf48c26b19 (diff)
Bitnet(1.75 bpw): slightly faster CUDA dot product
We get 205 t/s, so ~13% slower than 2 bit.
Diffstat (limited to 'iqk-quantize.cpp')
0 files changed, 0 insertions, 0 deletions