summaryrefslogtreecommitdiff
path: root/ggml-opencl.cpp
diff options
context:
space:
mode:
authorKawrakow <48489457+ikawrakow@users.noreply.github.com>2023-09-03 11:06:22 +0300
committerGitHub <noreply@github.com>2023-09-03 11:06:22 +0300
commitca82cf7bac0c91d03e3d320b3a865dd006f854ac (patch)
tree02b91ac7d85eba9234fb0d0d4152218909135bcb /ggml-opencl.cpp
parent6a31a3bd9806c85ed08266f6ab65181da0f30d03 (diff)
metal : more optimizations (#2959)
* Very minor speedup via simd-group synchronization in f16 x f32 * Another very minor speedup on metal * Quite significant PP speedup on metal * Another attempt * Minor * Massive improvement for TG for fp16 * ~4-5% improvement for Q8_0 TG on metal --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Diffstat (limited to 'ggml-opencl.cpp')
0 files changed, 0 insertions, 0 deletions