diff options
author | Kawrakow <iwankawrakow@gmail.com> | 2025-02-27 16:40:49 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2025-02-27 16:40:49 +0200 |
commit | b762db7c9264199c2d0f66e7d63e3b4884f3fc0c (patch) | |
tree | 01cc16988a4d21b4c1df367df23f4fd53e6b58a0 /examples/batched | |
parent | 51029edfdf286df76f9268fc87b9514291b2fe42 (diff) |
Option to use MLA without a transposed cache (#235)
The `-mla` command line option turns into an int from a bool.
mla = 0: use standard attention
mla = 1: use MLA with transposed cache
mla > 1: use MLA without transposed cache
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'examples/batched')
0 files changed, 0 insertions, 0 deletions