diff options
author | Kawrakow <48489457+ikawrakow@users.noreply.github.com> | 2024-09-10 09:43:05 +0300 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-09-10 09:43:05 +0300 |
commit | a1f7a03f500451be80ec4aeae44665c58cde311f (patch) | |
tree | 0373ddcc1eaf00fa09d368fe5b83c739c5257b06 /ggml/src/ggml-cann.cpp | |
parent | 918ada20faf7747bbda6b78503b5d72a90157844 (diff) |
IQ1_TN Metal implementation (#46)
* iq1_tn: Metal implementation
Rquires to change the get_rows and matrix multiplication kernels
to use a dequantizer type rather than a dequantization function.
But once this is done, we can simply reuse the iq1_bn implementation.
This change will also allow to add other quantization types that
have meta data (such as a row scale) stored at the beginning of
a row (or change existing quantization types to row-wise scales).
* Some cleanup
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'ggml/src/ggml-cann.cpp')
0 files changed, 0 insertions, 0 deletions