summaryrefslogtreecommitdiff
path: root/ggml/src/ggml-backend-impl.h
diff options
context:
space:
mode:
authorKawrakow <iwankawrakow@gmail.com>2025-04-01 10:31:06 +0200
committerGitHub <noreply@github.com>2025-04-01 10:31:06 +0200
commit190e7866db1d87a5da8b2d2b8d6619092b2ec72c (patch)
tree56bb72d89bdf4144d0e8a1fc11572f7928661da6 /ggml/src/ggml-backend-impl.h
parentb07a337bfe033f00fdad3ad0e809eecd8f5f2d2c (diff)
Quantization improvements (2) (#302)
* iq3_k: slightly better quantization Not much of a difference for most models, but this change avoids what it looks like a catastrophic failure for DeepSeek-Lite (PPL is now 7.041 vs 7.314 on main). * Small improvement for type-1 quants --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'ggml/src/ggml-backend-impl.h')
0 files changed, 0 insertions, 0 deletions