summaryrefslogtreecommitdiff
path: root/examples/embedding/README.md
diff options
context:
space:
mode:
Diffstat (limited to 'examples/embedding/README.md')
-rw-r--r--examples/embedding/README.md39
1 files changed, 39 insertions, 0 deletions
diff --git a/examples/embedding/README.md b/examples/embedding/README.md
index 2298ec3e..e3705b45 100644
--- a/examples/embedding/README.md
+++ b/examples/embedding/README.md
@@ -19,3 +19,42 @@ llama-embedding.exe -m ./path/to/model --log-disable -p "Hello World!" 2>$null
```
The above command will output space-separated float values.
+
+## extra parameters
+### --embd-normalize $integer$
+| $integer$ | description | formula |
+|-----------|---------------------|---------|
+| $-1$ | none |
+| $0$ | max absolute int16 | $\Large{{32760 * x_i} \over\max \lvert x_i\rvert}$
+| $1$ | taxicab | $\Large{x_i \over\sum \lvert x_i\rvert}$
+| $2$ | euclidean (default) | $\Large{x_i \over\sqrt{\sum x_i^2}}$
+| $>2$ | p-norm | $\Large{x_i \over\sqrt[p]{\sum \lvert x_i\rvert^p}}$
+
+### --embd-output-format $'string'$
+| $'string'$ | description | |
+|------------|------------------------------|--|
+| '' | same as before | (default)
+| 'array' | single embeddings | $[[x_1,...,x_n]]$
+| | multiple embeddings | $[[x_1,...,x_n],[x_1,...,x_n],...,[x_1,...,x_n]]$
+| 'json' | openai style |
+| 'json+' | add cosine similarity matrix |
+
+### --embd-separator $"string"$
+| $"string"$ | |
+|--------------|-|
+| "\n" | (default)
+| "<#embSep#>" | for exemple
+| "<#sep#>" | other exemple
+
+## examples
+### Unix-based systems (Linux, macOS, etc.):
+
+```bash
+./embedding -p 'Castle<#sep#>Stronghold<#sep#>Dog<#sep#>Cat' --embd-separator '<#sep#>' --embd-normalize 2 --embd-output-format '' -m './path/to/model.gguf' --n-gpu-layers 99 --log-disable 2>/dev/null
+```
+
+### Windows:
+
+```powershell
+embedding.exe -p 'Castle<#sep#>Stronghold<#sep#>Dog<#sep#>Cat' --embd-separator '<#sep#>' --embd-normalize 2 --embd-output-format '' -m './path/to/model.gguf' --n-gpu-layers 99 --log-disable 2>/dev/null
+```