summaryrefslogtreecommitdiff
path: root/examples/server/index.js.hpp
diff options
context:
space:
mode:
authorKawrakow <48489457+ikawrakow@users.noreply.github.com>2024-02-05 10:46:06 +0200
committerGitHub <noreply@github.com>2024-02-05 10:46:06 +0200
commit6fdfa2ecc684000a25a4ad91823bc82a6652b645 (patch)
treec98969391003efff3b83b4ede0a50759b80fa3ab /examples/server/index.js.hpp
parenta2d60c9158435ae9a6f14632f07f1acf7a3becef (diff)
iq2_xxs: tune quantization (#5320)
We get slightly better PPL, and we cut quantization time in nearly half. The trick is to 1st quantize without forcing points onto the E8-lattice. We can then use a narrower search range around the block scale that we got that way. Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'examples/server/index.js.hpp')
0 files changed, 0 insertions, 0 deletions