diff options
author | Kawrakow <iwankawrakow@gmail.com> | 2025-02-25 17:55:58 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2025-02-25 17:55:58 +0200 |
commit | 94b659a2f106e017e5eeb6f492dc9f290e136833 (patch) | |
tree | 8b744ff19b476f7d08e9691def83ad3fbb27c763 /examples/parallel | |
parent | 547eee81d99a2676975a9768166b7d164473b8fa (diff) |
Give the user the option to override where model weights are stored (#232)
* Give the user the option to override where model weights are stored
* Fix ggml_nbytes() problem and cleanup
For a tensor with zero elements ggml_nbytes() was returning
uint64_t::max, and this was causing graph allocation failure.
* Add timing info to CUDA graph evaluation
* Add more timing info
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'examples/parallel')
0 files changed, 0 insertions, 0 deletions