diff options
| author | Kawrakow <iwankawrakow@gmail.com> | 2025-02-22 09:38:51 +0200 | 
|---|---|---|
| committer | GitHub <noreply@github.com> | 2025-02-22 09:38:51 +0200 | 
| commit | c4a5103299e44adc8692e3e373c1974fa9fee270 (patch) | |
| tree | f0afc8baa6af5e7805835c76c711f0bc58771f73 /ggml/src/ggml-sycl/conv.hpp | |
| parent | b9a6639ac3bc77c64bba679cb85b14de0c4a9c9d (diff) | |
Better strategy for attention matrix multiplications when generating tokens  (#218)
* This seems to be a better way
to do the attention matrix multiplications in the TG case.
* Cleanup
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'ggml/src/ggml-sycl/conv.hpp')
0 files changed, 0 insertions, 0 deletions
