summaryrefslogtreecommitdiff
path: root/examples/json_schema_to_grammar.py
diff options
context:
space:
mode:
authorjiez <373447296@qq.com>2024-04-25 18:29:35 +0800
committerGitHub <noreply@github.com>2024-04-25 13:29:35 +0300
commit1966eb2615242f224bf9ca939db8905ab6a174a0 (patch)
tree3da33a1b5f816723e195a4936d44c4bef2eaa06a /examples/json_schema_to_grammar.py
parent784e11dea1f5ce9638851b2b0dddb107e2a609c8 (diff)
quantize : add '--keep-split' to quantize model into shards (#6688)
* Implement '--keep-split' to quantize model into several shards * Add test script * Update examples/quantize/quantize.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * Split model correctly even if tensor id is out-of-order * Update llama_model_quantize_params * Fix preci failures --------- Co-authored-by: z5269887 <z5269887@unsw.edu.au> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Diffstat (limited to 'examples/json_schema_to_grammar.py')
0 files changed, 0 insertions, 0 deletions