summaryrefslogtreecommitdiff
path: root/examples/llama-bench/llama-bench.cpp
diff options
context:
space:
mode:
authorOlivier Chafik <ochafik@users.noreply.github.com>2024-04-30 00:52:50 +0100
committerGitHub <noreply@github.com>2024-04-30 00:52:50 +0100
commit8843a98c2ba97a25e93319a104f9ddfaf83ce4c4 (patch)
tree82d73687b9dd42033a388d83c3b491925a0444b9 /examples/llama-bench/llama-bench.cpp
parentb8c1476e44cc1f3a1811613f65251cf779067636 (diff)
Improve usability of --model-url & related flags (#6930)
* args: default --model to models/ + filename from --model-url or --hf-file (or else legacy models/7B/ggml-model-f16.gguf) * args: main & server now call gpt_params_handle_model_default * args: define DEFAULT_MODEL_PATH + update cli docs * curl: check url of previous download (.json metadata w/ url, etag & lastModified) * args: fix update to quantize-stats.cpp * curl: support legacy .etag / .lastModified companion files * curl: rm legacy .etag file support * curl: reuse regex across headers callback calls * curl: unique_ptr to manage lifecycle of curl & outfile * curl: nit: no need for multiline regex flag * curl: update failed test (model file collision) + gitignore *.gguf.json
Diffstat (limited to 'examples/llama-bench/llama-bench.cpp')
0 files changed, 0 insertions, 0 deletions