diff options
author | Pierrick Hymbert <pierrick.hymbert@gmail.com> | 2024-02-24 19:16:04 +0100 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-02-24 19:16:04 +0100 |
commit | 9e359a4f47c1b2dceb99e29706c9f7403d32ab5e (patch) | |
tree | aa491d0744940ccce9ff69fe1bcc9e1f16b7a1ff /examples/server/tests/features/issues.feature | |
parent | 4c4cb30736582cacb1a164a9d4bc8e17b1014be7 (diff) |
server: continue to update other slots on embedding concurrent request (#5699)
* server: #5655 - continue to update other slots on embedding concurrent request.
* server: tests: add multi users embeddings as fixed
* server: tests: adding OAI compatible embedding concurrent endpoint
* server: tests: adding OAI compatible embedding with multiple inputs
Diffstat (limited to 'examples/server/tests/features/issues.feature')
-rw-r--r-- | examples/server/tests/features/issues.feature | 34 |
1 files changed, 1 insertions, 33 deletions
diff --git a/examples/server/tests/features/issues.feature b/examples/server/tests/features/issues.feature index 542006d9..bf5a175a 100644 --- a/examples/server/tests/features/issues.feature +++ b/examples/server/tests/features/issues.feature @@ -1,36 +1,4 @@ # List of ongoing issues @bug Feature: Issues - # Issue #5655 - Scenario: Multi users embeddings - Given a server listening on localhost:8080 - And a model file stories260K.gguf - And a model alias tinyllama-2 - And 42 as server seed - And 64 KV cache size - And 2 slots - And continuous batching - And embeddings extraction - Then the server is starting - Then the server is healthy - - Given a prompt: - """ - Write a very long story about AI. - """ - And a prompt: - """ - Write another very long music lyrics. - """ - And a prompt: - """ - Write a very long poem. - """ - And a prompt: - """ - Write a very long joke. - """ - Given concurrent embedding requests - Then the server is busy - Then the server is idle - Then all embeddings are generated + # No confirmed issue at the moment |