Age | Commit message (Expand) | Author |
---|---|---|
2024-02-18 | ggml, common, examples, tests : fixed type arguments in printf (#5528) | Herman Semenov |
2024-02-16 | ggml : add numa options (#5377) | bmwl |
2024-01-31 | llama : remove LLAMA_MAX_DEVICES and LLAMA_SUPPORTS_GPU_OFFLOAD (#5240) | Georgi Gerganov |
2024-01-12 | llama : ggml-backend integration (#4766) | slaren |
2023-12-01 | ggml : add ggml_soft_max_ext (#4256) | Georgi Gerganov |
2023-10-29 | Extend llama_kv_cache_seq_rm to allow matching any sequence (#3843) | Kerfuffle |
2023-10-25 | batched-bench : print params at start | Georgi Gerganov |
2023-10-18 | speculative : add tree-based sampling example (#3624) | Georgi Gerganov |
2023-10-11 | batched : add bench tool (#3545) | Georgi Gerganov |