diff options
author | jiez <373447296@qq.com> | 2024-04-12 18:45:06 +0800 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-04-12 13:45:06 +0300 |
commit | 91c736015b66ba1d0b82cbae6313b6d5eaa61b68 (patch) | |
tree | 098b60b95e78a1062daf0fe2b362de506eb23df7 /examples/finetune | |
parent | 5c4d767ac028c0f9c31cba3fceaf765c6097abfc (diff) |
llama : add gguf_remove_key + remove split meta during quantize (#6591)
* Remove split metadata when quantize model shards
* Find metadata key by enum
* Correct loop range for gguf_remove_key and code format
* Free kv memory
---------
Co-authored-by: z5269887 <z5269887@unsw.edu.au>
Diffstat (limited to 'examples/finetune')
0 files changed, 0 insertions, 0 deletions