summaryrefslogtreecommitdiff
path: root/examples/server/README.md
diff options
context:
space:
mode:
authorPierrick Hymbert <pierrick.hymbert@gmail.com>2024-02-18 17:31:28 +0100
committerGitHub <noreply@github.com>2024-02-18 18:31:28 +0200
commite75c6279d1c8e7abb82a331f5de7124eed402de2 (patch)
tree23890d09bc6e25bad33b008ab571a333e0df1537 /examples/server/README.md
parent36376abe05a12a8cb3af548a4af9b8d0e2e69597 (diff)
server : enhanced health endpoint (#5548)
* server: enrich health endpoint with available slots, return 503 if not slots are available * server: document new status no slot available in the README.md
Diffstat (limited to 'examples/server/README.md')
-rw-r--r--examples/server/README.md1
1 files changed, 1 insertions, 0 deletions
diff --git a/examples/server/README.md b/examples/server/README.md
index fe5cd8d5..5e3ae833 100644
--- a/examples/server/README.md
+++ b/examples/server/README.md
@@ -136,6 +136,7 @@ node index.js
- `{"status": "loading model"}` if the model is still being loaded.
- `{"status": "error"}` if the model failed to load.
- `{"status": "ok"}` if the model is successfully loaded and the server is ready for further requests mentioned below.
+ - `{"status": "no slot available", "slots_idle": 0, "slots_processing": 32}` if no slot are currently available
- **POST** `/completion`: Given a `prompt`, it returns the predicted completion.