diff options
author | Iwan Kawrakow <iwan.kawrakow@gmail.com> | 2024-06-17 14:16:24 +0200 |
---|---|---|
committer | Iwan Kawrakow <iwan.kawrakow@gmail.com> | 2024-06-22 12:02:51 +0300 |
commit | 30a771bd6bdaa78ca79e27b38783cedd000c7840 (patch) | |
tree | 64f76b3f70df06434aa137466222b349c2714fa5 /examples/llm.vim | |
parent | 8222c9f3d1e91096ab554f62ffbc384535b1963e (diff) |
iq1_bn: better NEON implementation
PP is decent with 131 t/s (q4_0 has 150 t/s).
TG is better than last commit but still bad at 33.1 t/s
(in comparison q4_0 gets 52.3 t/s).
I had to go to the (0, 1, 2) table. Apple Silicon clearly
does not like operations with signs.
Diffstat (limited to 'examples/llm.vim')
0 files changed, 0 insertions, 0 deletions