summaryrefslogtreecommitdiff
path: root/ggml/src/ggml-impl.h
diff options
context:
space:
mode:
authorKawrakow <iwankawrakow@gmail.com>2024-09-28 13:37:25 +0300
committerGitHub <noreply@github.com>2024-09-28 13:37:25 +0300
commit737514fd814d944f8ce965620293a16e5e8a285d (patch)
tree4b4b79eec0d1cbcc413dd3c6991b6d57439edd86 /ggml/src/ggml-impl.h
parent1f61e91862dd0b077ccb60459f3cc03f364ee279 (diff)
Adding SWIGLU unary op (#65)
* Adding GGML_UNARY_OP_SWIGLU This commit implements the ggml op and CPU compute forward. I see ~3-4% speedup of PP-512 for Phi-3.5-mini. * GGML_UNARY_OP_SWIGLU: CUDA implementation I observe ~12% speedup for PP-512(Phi-3.5-mini). * GGML_UNARY_OP_SWIGLU: Metal implementation We get ~2% speedup for PP-512(Phi-3.5-mini). * GGML_UNARY_OP_SWIGLU: minor improvement on Metal * GGML_UNARY_OP_SWIGLU: cleanup --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'ggml/src/ggml-impl.h')
0 files changed, 0 insertions, 0 deletions