summaryrefslogtreecommitdiff
path: root/ggml-kompute.cpp
AgeCommit message (Expand)Author
2024-05-29metal : remove invalid asserts (#7617)Georgi Gerganov
2024-05-29metal : add missing asserts (#7617)Georgi Gerganov
2024-05-29ggml : fix YARN + add tests + add asserts (#7617)Georgi Gerganov
2024-05-21llama : add phi3 128K model support (#7225)liuwei-git
2024-05-11ggml : full ALiBi support (#7192)Georgi Gerganov
2024-04-30ggml : add Flash Attention (#5021)Georgi Gerganov
2024-03-26llama : greatly reduce output buffer memory usage (#6122)compilade
2024-03-18backend : offload large batches to GPU (#6083)slaren
2024-03-13llama : add pipeline parallelism support (#6017)slaren
2024-03-04ggml : introduce ggml_status (ggml/750)Michael Podvitskiy
2024-02-28Introduce backend GUIDs (ggml/743)UEXTM.com
2024-01-29Nomic Vulkan backend (#4456)Jared Van Bortel