diff options
author | slaren <slarengh@gmail.com> | 2023-08-22 15:25:19 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2023-08-22 15:25:19 +0200 |
commit | 1123f7fbdfb8012e46f05e903e6f675922916378 (patch) | |
tree | 27f3700a672e8f0d09d86797ce1c199ff72a4d51 /examples/embedding/embedding.cpp | |
parent | ef3f333d3775600d1646a9fa249aca532d15fb89 (diff) |
ggml-cuda : use graph allocator (#2684)
use a different function for no_alloc to avoid breaking backwards compat, fixes lora
remove 512 n_batch limit
fixed 2048 batch size
cleanup
Co-authored-by: Johannes Gäßler <johannesg@5d6.de>
Diffstat (limited to 'examples/embedding/embedding.cpp')
0 files changed, 0 insertions, 0 deletions