diff options
author | Kawrakow <iwankawrakow@gmail.com> | 2025-06-24 14:21:37 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2025-06-24 14:21:37 +0200 |
commit | b5f2f0010624f9d2fc64f084113f7d38eb851a52 (patch) | |
tree | 002963c826287551e59f1835af75954ba62041d2 /ggml/src/ggml-cann.cpp | |
parent | 64f6c2dead0768049837ac6562c0c176fabc055e (diff) |
Much faster prompt processing for IQ1_S and IQ1_M on ARM_NEON (#553)
* iq1_s
66.3 t/s -> 168.8 t/s.
* iq1_m
19 t/s -> 163 t/s.
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'ggml/src/ggml-cann.cpp')
0 files changed, 0 insertions, 0 deletions