summaryrefslogtreecommitdiff
path: root/ggml/src/ggml-alloc.c
diff options
context:
space:
mode:
authorIwan Kawrakow <iwan.kawrakow@gmail.com>2024-07-30 12:33:48 +0300
committerKawrakow <48489457+ikawrakow@users.noreply.github.com>2024-08-01 09:38:06 +0200
commitab4f9e1fdb7441f8250364b248b5e709ec66771f (patch)
tree003a54ba333fb25ad5cf764610c72f1b6c0a8946 /ggml/src/ggml-alloc.c
parent69842c6ad805c7de8f0416e52a1f12d3357023d9 (diff)
iq2_k: CUDA dot product finally works
Performance is pathetic: 140 t/s for LLaMA-3.1-8B vs 172 t/s for iq2_xs.
Diffstat (limited to 'ggml/src/ggml-alloc.c')
0 files changed, 0 insertions, 0 deletions