diff options
author | Kawrakow <iwankawrakow@gmail.com> | 2025-01-30 18:36:24 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2025-01-30 18:36:24 +0200 |
commit | ecf111a11ca56ff0731308f94bd6c5e96658b6ef (patch) | |
tree | f05decc6721785febc889b246955571c32b28b4f /ggml/src/ggml-quants.h | |
parent | 2e6b523853a8659c63283a6deca805051ecd713a (diff) |
Deepseek-Lite (#184)
* Quantization mixes tweaks
* Make iq4_nl_r4 work with row size that are not a multiple of 128
... on Zen4
* Make iq4_nl_r4 work with row size that are not a multiple of 128
... on AVX2
* Make iq4_nl_r4 work with row size that are not a multiple of 128
... on AVX2
* Make q6_0_w4 work with row size that are not a multiple of 128
... on Zen4
* Make q6_0_w4 work with row size that are not a multiple of 128
... on Zen4
* Make q5_0_r4 work with row size that are not a multiple of 128
... on Zen4 and AVX2
* Make q5,6_0_r4, iq4_nl_e4 work with row size that are not a multiple of 128
also on NEON.
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'ggml/src/ggml-quants.h')
0 files changed, 0 insertions, 0 deletions