summaryrefslogtreecommitdiff
path: root/ggml/src/ggml-cann
diff options
context:
space:
mode:
authorKawrakow <48489457+ikawrakow@users.noreply.github.com>2024-09-10 09:43:05 +0300
committerGitHub <noreply@github.com>2024-09-10 09:43:05 +0300
commita1f7a03f500451be80ec4aeae44665c58cde311f (patch)
tree0373ddcc1eaf00fa09d368fe5b83c739c5257b06 /ggml/src/ggml-cann
parent918ada20faf7747bbda6b78503b5d72a90157844 (diff)
IQ1_TN Metal implementation (#46)
* iq1_tn: Metal implementation Rquires to change the get_rows and matrix multiplication kernels to use a dequantizer type rather than a dequantization function. But once this is done, we can simply reuse the iq1_bn implementation. This change will also allow to add other quantization types that have meta data (such as a row scale) stored at the beginning of a row (or change existing quantization types to row-wise scales). * Some cleanup --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'ggml/src/ggml-cann')
0 files changed, 0 insertions, 0 deletions