diff options
author | Xiao-Yong Jin <jinxiaoyong@gmail.com> | 2023-07-25 07:19:11 -0500 |
---|---|---|
committer | GitHub <noreply@github.com> | 2023-07-25 15:19:11 +0300 |
commit | 0c06204fb39aa5560e883e0ae74be9518c57d88e (patch) | |
tree | b2b218adf5dfe353d744d8b46d9f20f7c40d66a6 /examples/common.cpp | |
parent | 1fed755b1fb9babb6dbc1b4023e492950cd5a5be (diff) |
main : add `--in-prefix-bos` to prefix BOS to user inputs; keep EOS (#2304)
* add `--in-prefix-bos` to prefix BOS to user inputs; keep EOS
The BOS precedes the string specified by `--in-prefix`.
Model generated EOS is now kept in the context.
It provides a way to strictly following the prompt format used in
Llama-2-chat.
The EOS handling also benefits some existing finetunes that uses
EOS to mark the end of turn.
* examples/common: move input_prefix_bos to other bools
Diffstat (limited to 'examples/common.cpp')
-rw-r--r-- | examples/common.cpp | 3 |
1 files changed, 3 insertions, 0 deletions
diff --git a/examples/common.cpp b/examples/common.cpp index 0e88a128..dd964c8a 100644 --- a/examples/common.cpp +++ b/examples/common.cpp @@ -432,6 +432,8 @@ bool gpt_params_parse(int argc, char ** argv, gpt_params & params) { exit(0); } else if (arg == "--random-prompt") { params.random_prompt = true; + } else if (arg == "--in-prefix-bos") { + params.input_prefix_bos = true; } else if (arg == "--in-prefix") { if (++i >= argc) { invalid_param = true; @@ -517,6 +519,7 @@ void gpt_print_usage(int /*argc*/, char ** argv, const gpt_params & params) { fprintf(stdout, " not supported with --interactive or other interactive options\n"); fprintf(stdout, " --prompt-cache-ro if specified, uses the prompt cache but does not update it.\n"); fprintf(stdout, " --random-prompt start with a randomized prompt.\n"); + fprintf(stdout, " --in-prefix-bos prefix BOS to user inputs, preceding the `--in-prefix` string\n"); fprintf(stdout, " --in-prefix STRING string to prefix user inputs with (default: empty)\n"); fprintf(stdout, " --in-suffix STRING string to suffix after user inputs with (default: empty)\n"); fprintf(stdout, " -f FNAME, --file FNAME\n"); |