summaryrefslogtreecommitdiff
path: root/ggml-impl.h
diff options
context:
space:
mode:
authorKawrakow <48489457+ikawrakow@users.noreply.github.com>2024-07-26 12:57:23 +0200
committerGitHub <noreply@github.com>2024-07-26 12:57:23 +0200
commit0684c3e9c70d49323b4fc517128cbe222cab7f96 (patch)
treea193b03f1f02a4e0eba858e29b8c15de45604153 /ggml-impl.h
parent94b5916319cf1f00c0215dfcee9b531896476c5f (diff)
Offload Bitnet token embeddings to the GPU - the right way (#2)
OK, I should have checked how it was done for Gemma and do the same for Bitnet. But better late than never. Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'ggml-impl.h')
0 files changed, 0 insertions, 0 deletions