summaryrefslogtreecommitdiff
path: root/examples/server/tests/features/issues.feature
diff options
context:
space:
mode:
authorPierrick Hymbert <pierrick.hymbert@gmail.com>2024-02-24 19:16:04 +0100
committerGitHub <noreply@github.com>2024-02-24 19:16:04 +0100
commit9e359a4f47c1b2dceb99e29706c9f7403d32ab5e (patch)
treeaa491d0744940ccce9ff69fe1bcc9e1f16b7a1ff /examples/server/tests/features/issues.feature
parent4c4cb30736582cacb1a164a9d4bc8e17b1014be7 (diff)
server: continue to update other slots on embedding concurrent request (#5699)
* server: #5655 - continue to update other slots on embedding concurrent request. * server: tests: add multi users embeddings as fixed * server: tests: adding OAI compatible embedding concurrent endpoint * server: tests: adding OAI compatible embedding with multiple inputs
Diffstat (limited to 'examples/server/tests/features/issues.feature')
-rw-r--r--examples/server/tests/features/issues.feature34
1 files changed, 1 insertions, 33 deletions
diff --git a/examples/server/tests/features/issues.feature b/examples/server/tests/features/issues.feature
index 542006d9..bf5a175a 100644
--- a/examples/server/tests/features/issues.feature
+++ b/examples/server/tests/features/issues.feature
@@ -1,36 +1,4 @@
# List of ongoing issues
@bug
Feature: Issues
- # Issue #5655
- Scenario: Multi users embeddings
- Given a server listening on localhost:8080
- And a model file stories260K.gguf
- And a model alias tinyllama-2
- And 42 as server seed
- And 64 KV cache size
- And 2 slots
- And continuous batching
- And embeddings extraction
- Then the server is starting
- Then the server is healthy
-
- Given a prompt:
- """
- Write a very long story about AI.
- """
- And a prompt:
- """
- Write another very long music lyrics.
- """
- And a prompt:
- """
- Write a very long poem.
- """
- And a prompt:
- """
- Write a very long joke.
- """
- Given concurrent embedding requests
- Then the server is busy
- Then the server is idle
- Then all embeddings are generated
+ # No confirmed issue at the moment