summaryrefslogtreecommitdiff
path: root/examples/gguf-split/README.md
diff options
context:
space:
mode:
authorPierrick Hymbert <pierrick.hymbert@gmail.com>2024-03-19 12:05:44 +0100
committerGitHub <noreply@github.com>2024-03-19 12:05:44 +0100
commitd0d5de42e5a65865b5fddb6f5c785083539b74c3 (patch)
tree06e1d97ac44314f39c1b40358c2838b46bed71d9 /examples/gguf-split/README.md
parentb80cf3b2d1dee0ad325f7a794fecc66befce7336 (diff)
gguf-split: split and merge gguf per batch of tensors (#6135)
* gguf-split: split and merge gguf files per tensor * gguf-split: build with make toolchain * gguf-split: rename `--split-tensors-size` to `--split-max-tensors`. Set general.split_count KV to all split * split : minor style + fix compile warnings * gguf-split: remove --upload not implemented --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Diffstat (limited to 'examples/gguf-split/README.md')
-rw-r--r--examples/gguf-split/README.md9
1 files changed, 9 insertions, 0 deletions
diff --git a/examples/gguf-split/README.md b/examples/gguf-split/README.md
new file mode 100644
index 00000000..ddb1f764
--- /dev/null
+++ b/examples/gguf-split/README.md
@@ -0,0 +1,9 @@
+## GGUF split Example
+
+CLI to split / merge GGUF files.
+
+**Command line options:**
+
+- `--split`: split GGUF to multiple GGUF, default operation.
+- `--split-max-tensors`: maximum tensors in each split: default(128)
+- `--merge`: merge multiple GGUF to a single GGUF.