Age | Commit message (Expand) | Author |
---|---|---|
2024-07-27 | Merge mainline llama.cpp (#3) | Kawrakow |
2024-06-13 | `build`: rename main → llama-cli, server → llama-server, llava-cli → ll... | Olivier Chafik |
2024-05-16 | Revert "server bench: fix bench not waiting for model load (#7284)" (#7334) | Pierrick Hymbert |
2024-05-15 | server bench: fix bench not waiting for model load (#7284) | Johannes Gäßler |
2024-04-30 | ggml : add Flash Attention (#5021) | Georgi Gerganov |
2024-04-06 | ci: bench: support sse and fix prompt processing time / server: add tokens us... | Pierrick Hymbert |
2024-04-04 | ci: bench: add more ftype, fix triggers and bot comment (#6466) | Pierrick Hymbert |
2024-03-27 | server: continuous performance monitoring and PR comment (#6283) | Pierrick Hymbert |