summaryrefslogtreecommitdiff
path: root/examples/server
diff options
context:
space:
mode:
authorDouglas Hanley <thesecretaryofwar@gmail.com>2024-02-28 02:51:11 -0600
committerGitHub <noreply@github.com>2024-02-28 10:51:11 +0200
commit177628bfd85565070916ad66a5ac4071ee0527d8 (patch)
tree1532ad96e287a0d8bff4aef92bf2e04eabecec9e /examples/server
parent6c4416868df2e5455da7d20547f62bcf9735ba8e (diff)
llama : improve BERT tokenization (#5740)
* implement nfd for stripping accents in wpm tokenizer * sort nfd map; reuse iterator * use builtin tolower * add locale include * Simplify to_lower cases Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com> --------- Co-authored-by: Jared Van Bortel <cebtenzzre@gmail.com>
Diffstat (limited to 'examples/server')
0 files changed, 0 insertions, 0 deletions