diff options
author | Kawrakow <48489457+ikawrakow@users.noreply.github.com> | 2024-01-11 20:43:15 +0100 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-01-11 21:43:15 +0200 |
commit | 469e75d0a35b08de549a4fd87f082ca7a8a539ba (patch) | |
tree | 39969cc5ba3c124a5464f1a2ec177429bf4c516e /examples/server/server.cpp | |
parent | 49662cbed3e95f5976c070b85b9fd53fd577038d (diff) |
llama : restore intended k-quants mixes for MoE models (#4872)
* Restore intended k-quants quantization mixes for MoE models
* Update Q2_K_S values in the quantize tool
Still using LLaMA-v1 PPL values in the quant description
today does not make much sense. But let's leave this update
for another PR.
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Diffstat (limited to 'examples/server/server.cpp')
0 files changed, 0 insertions, 0 deletions