summaryrefslogtreecommitdiff
path: root/examples/train-text-from-scratch/README.md
diff options
context:
space:
mode:
Diffstat (limited to 'examples/train-text-from-scratch/README.md')
-rw-r--r--examples/train-text-from-scratch/README.md11
1 files changed, 8 insertions, 3 deletions
diff --git a/examples/train-text-from-scratch/README.md b/examples/train-text-from-scratch/README.md
index f4ffcd98..1b345406 100644
--- a/examples/train-text-from-scratch/README.md
+++ b/examples/train-text-from-scratch/README.md
@@ -10,9 +10,9 @@ wget https://raw.githubusercontent.com/brunoklein99/deep-learning-notes/master/s
./bin/train-text-from-scratch \
--vocab-model ../models/ggml-vocab-llama.gguf \
--ctx 64 --embd 256 --head 8 --layer 16 \
- --checkpoint-in chk-shakespeare-256x16.gguf \
- --checkpoint-out chk-shakespeare-256x16.gguf \
- --model-out ggml-shakespeare-256x16-f32.gguf \
+ --checkpoint-in chk-shakespeare-256x16-LATEST.gguf \
+ --checkpoint-out chk-shakespeare-256x16-ITERATION.gguf \
+ --model-out ggml-shakespeare-256x16-f32-ITERATION.gguf \
--train-data "shakespeare.txt" \
-t 6 -b 16 --seed 1 --adam-iter 256 \
--no-checkpointing
@@ -20,3 +20,8 @@ wget https://raw.githubusercontent.com/brunoklein99/deep-learning-notes/master/s
# predict
./bin/main -m ggml-shakespeare-256x16-f32.gguf
```
+
+Output files will be saved every N iterations (config with `--save-every N`).
+The pattern "ITERATION" in the output filenames will be replaced with the iteration number and "LATEST" for the latest output.
+
+To train GGUF models just pass them to `--checkpoint-in FN`.