summaryrefslogtreecommitdiff
path: root/ggml/src/ggml-cuda/concat.cu
AgeCommit message (Expand)Author
2025-04-07Add copyright notices (#317)Kawrakow
2025-03-18Make Q8_0 KV cache work with mla=2,fa on CUDA (#264)Kawrakow
2025-03-18FlashMLA-2: reduce compute buffer size (CUDA and CPU) (#260)Kawrakow
2025-03-01Reduce size of compute buffers (#237)Kawrakow
2024-07-27Merge mainline llama.cpp (#3)Kawrakow