diff options
author | Kawrakow <iwankawrakow@gmail.com> | 2024-10-14 10:46:41 +0300 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-10-14 10:46:41 +0300 |
commit | 302a6225a1f113eb384999aebd7f3ed6c0baed06 (patch) | |
tree | b1d8bc73f6c7d9ddfe9474371c2da5fcdc620b47 /examples | |
parent | baab1d9a1e5d28bddb91dd962223be558bf7737d (diff) |
iq3_k: fix and optimize Metal dot product (#87)
* iq3_k: fix Metal dot product
I was accessing the scales as 4-byte aligned, but iq3_k is
not 4-byte aligned. Instead of throwing an error (as it happens
on CUDA when one makes this mistake), Metal silently accepts
and we get garbage.
* iq3_k: slightly faster Metal dot product
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'examples')
0 files changed, 0 insertions, 0 deletions