diff options
author | Kawrakow <iwankawrakow@gmail.com> | 2025-05-12 07:49:51 +0300 |
---|---|---|
committer | GitHub <noreply@github.com> | 2025-05-12 07:49:51 +0300 |
commit | f27cd405422307e02dffa8949ac30bc56b4d2900 (patch) | |
tree | 722b742827684815ca2cc0fb6379edd4edd2f3fd /examples | |
parent | 465569dff8b49a195450a0eb1974fd72a32fcebc (diff) |
Enable faster prompt processing with mainline llama.cpp GGUFs (#409)
* Enable MLA-3 in crippled GGUFs: WIP
* Enable MLA-3 in crippled GGUFs: seems to work
* Add newly created tensors to model.tensors_by_name
Else they don't get run-time repacked.
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'examples')
0 files changed, 0 insertions, 0 deletions