Respect tokenizer.ggml.add_bos_token value when tokenizing (#4040)

* gguf-py: gguf-dump: Respect --no-tensor flag in JSON mode. * Respect add_bos_token GGUF metadata value * gguf-py: Try to fix SpecialVocab giving up too easily for the Nth time
author: Kerfuffle <44031344+KerfuffleV2@users.noreply.github.com> 2023-11-16 19:14:37 -0700
committer: GitHub <noreply@github.com> 2023-11-16 19:14:37 -0700
commit: 91f6499393d2d999331fbfdba47a7f8b9f913f0d (patch)
tree: 27caf3ad0b9cec979bb5ed3317b5334bdcd9470c /examples/main
parent: 8da46278e1a57107591653275f8e03a281de94f0 (diff)
1 files changed, 1 insertions, 1 deletions
diff --git a/examples/main/main.cpp b/examples/main/main.cpp
index 8d985c82..99d219d6 100644
--- a/examples/main/main.cpp
+++ b/examples/main/main.cpp
@@ -229,7 +229,7 @@ int main(int argc, char ** argv) {
         }
     }
 
-    const bool add_bos = llama_vocab_type(model) == LLAMA_VOCAB_TYPE_SPM;
+    const bool add_bos = llama_should_add_bos_token(model);
     LOG("add_bos: %d\n", add_bos);
 
     std::vector<llama_token> embd_inp;
author	Kerfuffle <44031344+KerfuffleV2@users.noreply.github.com>	2023-11-16 19:14:37 -0700
committer	GitHub <noreply@github.com>	2023-11-16 19:14:37 -0700
commit	91f6499393d2d999331fbfdba47a7f8b9f913f0d (patch)
tree	27caf3ad0b9cec979bb5ed3317b5334bdcd9470c /examples/main
parent	8da46278e1a57107591653275f8e03a281de94f0 (diff)