server: continue to update other slots on embedding concurrent request (#5699)

* server: #5655 - continue to update other slots on embedding concurrent request. * server: tests: add multi users embeddings as fixed * server: tests: adding OAI compatible embedding concurrent endpoint * server: tests: adding OAI compatible embedding with multiple inputs
author: Pierrick Hymbert <pierrick.hymbert@gmail.com> 2024-02-24 19:16:04 +0100
committer: GitHub <noreply@github.com> 2024-02-24 19:16:04 +0100
commit: 9e359a4f47c1b2dceb99e29706c9f7403d32ab5e (patch)
tree: aa491d0744940ccce9ff69fe1bcc9e1f16b7a1ff /examples/server/tests/features/issues.feature
parent: 4c4cb30736582cacb1a164a9d4bc8e17b1014be7 (diff)
1 files changed, 1 insertions, 33 deletions
diff --git a/examples/server/tests/features/issues.feature b/examples/server/tests/features/issues.feature
index 542006d9..bf5a175a 100644
--- a/examples/server/tests/features/issues.feature
+++ b/examples/server/tests/features/issues.feature
@@ -1,36 +1,4 @@
 # List of ongoing issues
 @bug
 Feature: Issues
-    # Issue #5655
-  Scenario: Multi users embeddings
-    Given a server listening on localhost:8080
-    And   a model file stories260K.gguf
-    And   a model alias tinyllama-2
-    And   42 as server seed
-    And   64 KV cache size
-    And   2 slots
-    And   continuous batching
-    And   embeddings extraction
-    Then  the server is starting
-    Then  the server is healthy
-
-    Given a prompt:
-      """
-      Write a very long story about AI.
-      """
-    And a prompt:
-      """
-      Write another very long music lyrics.
-      """
-    And a prompt:
-      """
-      Write a very long poem.
-      """
-    And a prompt:
-      """
-      Write a very long joke.
-      """
-    Given concurrent embedding requests
-    Then the server is busy
-    Then the server is idle
-    Then all embeddings are generated
+  # No confirmed issue at the moment
author	Pierrick Hymbert <pierrick.hymbert@gmail.com>	2024-02-24 19:16:04 +0100
committer	GitHub <noreply@github.com>	2024-02-24 19:16:04 +0100
commit	9e359a4f47c1b2dceb99e29706c9f7403d32ab5e (patch)
tree	aa491d0744940ccce9ff69fe1bcc9e1f16b7a1ff /examples/server/tests/features/issues.feature
parent	4c4cb30736582cacb1a164a9d4bc8e17b1014be7 (diff)