index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
ggml-kompute.cpp
Age
Commit message (
Expand
)
Author
2024-06-13
move BLAS to a separate backend (#6210)
slaren
2024-06-12
tests : add non-cont unary tests (#7857)
Georgi Gerganov
2024-06-05
ggml : refactor rope norm/neox (#7634)
Georgi Gerganov
2024-06-03
kompute : implement op_getrows_f32 (#6403)
woachk
2024-05-29
metal : remove invalid asserts (#7617)
Georgi Gerganov
2024-05-29
metal : add missing asserts (#7617)
Georgi Gerganov
2024-05-29
ggml : fix YARN + add tests + add asserts (#7617)
Georgi Gerganov
2024-05-21
llama : add phi3 128K model support (#7225)
liuwei-git
2024-05-11
ggml : full ALiBi support (#7192)
Georgi Gerganov
2024-04-30
ggml : add Flash Attention (#5021)
Georgi Gerganov
2024-03-26
llama : greatly reduce output buffer memory usage (#6122)
compilade
2024-03-18
backend : offload large batches to GPU (#6083)
slaren
2024-03-13
llama : add pipeline parallelism support (#6017)
slaren
2024-03-04
ggml : introduce ggml_status (ggml/750)
Michael Podvitskiy
2024-02-28
Introduce backend GUIDs (ggml/743)
UEXTM.com
2024-01-29
Nomic Vulkan backend (#4456)
Jared Van Bortel