summaryrefslogtreecommitdiff
path: root/ggml-cuda/fattn-common.cuh
AgeCommit message (Expand)Author
2024-05-18CUDA: deduplicate FlashAttention code (#7352)Johannes Gäßler
2024-05-12CUDA: add FP32 FlashAttention vector kernel (#7188)Johannes Gäßler