summaryrefslogtreecommitdiff
path: root/ggml-cuda/common.cuh
AgeCommit message (Expand)Author
2024-05-01CUDA: CUDART < 11.7 workaround for __hmax, __hmax2 (#7019)Johannes Gäßler
2024-04-30ggml : add Flash Attention (#5021)Georgi Gerganov
2024-04-09llama : add Command R Plus support (#6491)Carolinabanana
2024-03-29sync : ggml (#6351)Georgi Gerganov
2024-03-25cuda : refactor into multiple files (#6269)slaren