summaryrefslogtreecommitdiff
path: root/ggml-cuda/fattn-vec-f16.cuh
AgeCommit message (Expand)Author
2024-05-12CUDA: add FP32 FlashAttention vector kernel (#7188)Johannes Gäßler