diff options
| author | Iwan Kawrakow <iwan.kawrakow@gmail.com> | 2024-07-30 12:33:48 +0300 |
|---|---|---|
| committer | Kawrakow <48489457+ikawrakow@users.noreply.github.com> | 2024-08-01 09:38:06 +0200 |
| commit | ab4f9e1fdb7441f8250364b248b5e709ec66771f (patch) | |
| tree | 003a54ba333fb25ad5cf764610c72f1b6c0a8946 /examples/gguf-hash/deps | |
| parent | 69842c6ad805c7de8f0416e52a1f12d3357023d9 (diff) | |
iq2_k: CUDA dot product finally works
Performance is pathetic: 140 t/s for LLaMA-3.1-8B vs
172 t/s for iq2_xs.
Diffstat (limited to 'examples/gguf-hash/deps')
0 files changed, 0 insertions, 0 deletions
