diff options
author | Georgi Gerganov <ggerganov@gmail.com> | 2023-12-13 14:05:38 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2023-12-13 14:05:38 +0200 |
commit | 113f9942fc73a262c85e9dcf7c2ea7336250bba0 (patch) | |
tree | a7fd59100e07b3bb7105e6d7a3d86682513b7f81 | |
parent | 799a1cb13b0b1b560ab0ceff485caed68faa8f1f (diff) |
readme : update hot topics
-rw-r--r-- | README.md | 1 |
1 files changed, 1 insertions, 0 deletions
@@ -10,6 +10,7 @@ Inference of [LLaMA](https://arxiv.org/abs/2302.13971) model in pure C/C++ ### Hot topics +- Added Mixtral support: https://github.com/ggerganov/llama.cpp/pull/4406 - **llama.h API change for handling KV cache offloading and data type: https://github.com/ggerganov/llama.cpp/pull/4309** - Using `llama.cpp` with AWS instances: https://github.com/ggerganov/llama.cpp/discussions/4225 - Looking for contributions to improve and maintain the `server` example: https://github.com/ggerganov/llama.cpp/issues/4216 |