diff options
author | Kawrakow <iwankawrakow@gmail.com> | 2025-02-22 09:38:51 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2025-02-22 09:38:51 +0200 |
commit | c4a5103299e44adc8692e3e373c1974fa9fee270 (patch) | |
tree | f0afc8baa6af5e7805835c76c711f0bc58771f73 /src | |
parent | b9a6639ac3bc77c64bba679cb85b14de0c4a9c9d (diff) |
Better strategy for attention matrix multiplications when generating tokens (#218)
* This seems to be a better way
to do the attention matrix multiplications in the TG case.
* Cleanup
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'src')
0 files changed, 0 insertions, 0 deletions