diff options
author | Kawrakow <48489457+ikawrakow@users.noreply.github.com> | 2023-09-03 11:06:22 +0300 |
---|---|---|
committer | GitHub <noreply@github.com> | 2023-09-03 11:06:22 +0300 |
commit | ca82cf7bac0c91d03e3d320b3a865dd006f854ac (patch) | |
tree | 02b91ac7d85eba9234fb0d0d4152218909135bcb /ggml-opencl.cpp | |
parent | 6a31a3bd9806c85ed08266f6ab65181da0f30d03 (diff) |
metal : more optimizations (#2959)
* Very minor speedup via simd-group synchronization in f16 x f32
* Another very minor speedup on metal
* Quite significant PP speedup on metal
* Another attempt
* Minor
* Massive improvement for TG for fp16
* ~4-5% improvement for Q8_0 TG on metal
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Diffstat (limited to 'ggml-opencl.cpp')
0 files changed, 0 insertions, 0 deletions