diff options
author | Georgi Gerganov <ggerganov@gmail.com> | 2023-08-27 14:44:35 +0300 |
---|---|---|
committer | GitHub <noreply@github.com> | 2023-08-27 14:44:35 +0300 |
commit | c48c5bb0b06385f6c708339188d2aaf2bc278477 (patch) | |
tree | 660b0bc9fcda81b886108a698f7b7a1697c0c8a2 | |
parent | d0cee0d36d5be95a0d9088b674dbb27354107221 (diff) |
readme : update hot topics
-rw-r--r-- | README.md | 4 |
1 files changed, 4 insertions, 0 deletions
@@ -11,6 +11,10 @@ Inference of [LLaMA](https://arxiv.org/abs/2302.13971) model in pure C/C++ ### Hot topics +- ## IMPORTANT: Tokenizer fixes and API change (developers and projects using `llama.cpp` built-in tokenization must read): https://github.com/ggerganov/llama.cpp/pull/2810 + +- ## GGUFv2 adds support for 64-bit sizes + backwards compatible: https://github.com/ggerganov/llama.cpp/pull/2821 + - Added support for Falcon models: https://github.com/ggerganov/llama.cpp/pull/2717 - A new file format has been introduced: [GGUF](https://github.com/ggerganov/llama.cpp/pull/2398) |