diff options
author | Georgi Gerganov <ggerganov@gmail.com> | 2024-01-08 20:25:17 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-01-08 20:25:17 +0200 |
commit | a9a8c5de3d2028701c239d821b220214fcaefbf1 (patch) | |
tree | cf1b74df755d545ea433a2a66191acfa1bc21662 | |
parent | dd5ae06405c5565b99889bdb3f168f4351252cfb (diff) |
readme : add link to SOTA models
-rw-r--r-- | README.md | 1 |
1 files changed, 1 insertions, 0 deletions
@@ -10,6 +10,7 @@ Inference of [LLaMA](https://arxiv.org/abs/2302.13971) model in pure C/C++ ### Hot topics +- New SOTA quantized models, including pure 2-bits: https://huggingface.co/ikawrakow - Collecting Apple Silicon performance stats: - M-series: https://github.com/ggerganov/llama.cpp/discussions/4167 - A-series: https://github.com/ggerganov/llama.cpp/discussions/4508 |