diff options
author | Georgi Gerganov <ggerganov@gmail.com> | 2023-08-27 16:40:48 +0300 |
---|---|---|
committer | GitHub <noreply@github.com> | 2023-08-27 16:40:48 +0300 |
commit | eaa13a48ff4136f01c1cdb79cacd61b67ec53095 (patch) | |
tree | 1e22d465164eb73b72dd6dab345987ea5691e6f2 /examples | |
parent | da7455d0467b5f5cc2e45d0dcffaf098df13db63 (diff) |
falcon : fix CUDA inference by making K and Q contiguous (#2830)
* falcon : fix CUDA inference by making K and Q contiguous
ggml-ci
* cuda : add assert to guard from non-cont ropes
Diffstat (limited to 'examples')
0 files changed, 0 insertions, 0 deletions