Age | Commit message (Expand) | Author |
---|---|---|
2024-03-23 | common: llama_load_model_from_url split support (#6192) | Pierrick Hymbert |
2024-03-09 | server: tests: add truncated prompt tests, better kv cache size (#5933) | Pierrick Hymbert |
2024-03-07 | server : refactor (#5882) | Georgi Gerganov |
2024-03-02 | server: tests: passkey challenge / self-extend with context shift demo (#5832) | Pierrick Hymbert |
2024-02-28 | server : add "/chat/completions" alias for "/v1/...` (#5722) | Jorge A |
2024-02-24 | server: continue to update other slots on embedding concurrent request (#5699) | Pierrick Hymbert |
2024-02-24 | server: init functional tests (#5566) | Pierrick Hymbert |