summaryrefslogtreecommitdiff
path: root/ggml-cuda
AgeCommit message (Expand)Author
2024-04-09llama : add Command R Plus support (#6491)Carolinabanana
2024-04-03ggml : mul_mat_id use the same tensor for all the experts (#6387)slaren
2024-03-29sync : ggml (#6351)Georgi Gerganov
2024-03-26IQ1_M: 1.75 bpw quantization (#6302)Kawrakow
2024-03-25cuda : fix LLAMA_CUDA_F16 build (#6298)slaren
2024-03-25cuda : refactor into multiple files (#6269)slaren