summaryrefslogtreecommitdiff
path: root/examples/server/tests/features/steps/steps.py
AgeCommit message (Expand)Author
2024-03-27server: continuous performance monitoring and PR comment (#6283)Pierrick Hymbert
2024-03-23common: llama_load_model_from_url split support (#6192)Pierrick Hymbert
2024-03-21json-schema-to-grammar improvements (+ added to server) (#5978)Olivier Chafik
2024-03-20server : allow to override -ngl in tests (#6170)Georgi Gerganov
2024-03-20server tests : more pythonic process management; fix bare `except:` (#6146)Jared Van Bortel
2024-03-17common: llama_load_model_from_url using --model-url (#6098)Pierrick Hymbert
2024-03-14server: disable debug release type sanitizer, simplify trigger (#6047)Pierrick Hymbert
2024-03-13llama : add pipeline parallelism support (#6017)slaren
2024-03-11Server: format error to json (#5961)Xuan Son Nguyen
2024-03-10server: ci: windows build and tests (#5968)Pierrick Hymbert
2024-03-09Server: reorganize some http logic (#5939)Xuan Son Nguyen
2024-03-09server: tests: add truncated prompt tests, better kv cache size (#5933)Pierrick Hymbert
2024-03-08server: metrics: add llamacpp:prompt_seconds_total and llamacpp:tokens_predic...Pierrick Hymbert
2024-03-07server : refactor (#5882)Georgi Gerganov
2024-03-02server: tests: passkey challenge / self-extend with context shift demo (#5832)Pierrick Hymbert
2024-02-28server : add "/chat/completions" alias for "/v1/...` (#5722)Jorge A
2024-02-25server: tests - slow inference causes timeout on the CI (#5715)Pierrick Hymbert
2024-02-25server: logs - unified format and --log-format option (#5700)Pierrick Hymbert
2024-02-25server: concurrency fix + monitoring - add /metrics prometheus compatible end...Pierrick Hymbert
2024-02-24server: continue to update other slots on embedding concurrent request (#5699)Pierrick Hymbert
2024-02-24server: init functional tests (#5566)Pierrick Hymbert