summaryrefslogtreecommitdiff
path: root/examples
diff options
context:
space:
mode:
authorKawrakow <iwankawrakow@gmail.com>2024-10-14 10:46:41 +0300
committerGitHub <noreply@github.com>2024-10-14 10:46:41 +0300
commit302a6225a1f113eb384999aebd7f3ed6c0baed06 (patch)
treeb1d8bc73f6c7d9ddfe9474371c2da5fcdc620b47 /examples
parentbaab1d9a1e5d28bddb91dd962223be558bf7737d (diff)
iq3_k: fix and optimize Metal dot product (#87)
* iq3_k: fix Metal dot product I was accessing the scales as 4-byte aligned, but iq3_k is not 4-byte aligned. Instead of throwing an error (as it happens on CUDA when one makes this mistake), Metal silently accepts and we get garbage. * iq3_k: slightly faster Metal dot product --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'examples')
0 files changed, 0 insertions, 0 deletions