Age | Commit message (Expand) | Author |
---|---|---|
2024-02-25 | server: concurrency fix + monitoring - add /metrics prometheus compatible end... | Pierrick Hymbert |
2024-02-21 | server: health: fix race condition on slots data using tasks queue (#5634) | Pierrick Hymbert |
2024-02-20 | Server: use llama_chat_apply_template (#5593) | Xuan Son Nguyen |
2024-02-18 | server : graceful server shutdown (#5244) | Daniel Hiltgen |
2024-02-11 | server : add llama2 chat template (#5425) | Xuan Son Nguyen |
2024-01-27 | sync : ggml | Georgi Gerganov |
2024-01-26 | server : refactored the task processing logic (#5065) | Xuan Son Nguyen |