diff options
author | Kawrakow <iwankawrakow@gmail.com> | 2024-12-08 09:13:10 +0100 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-12-08 09:13:10 +0100 |
commit | ef95b81733599429fdd63e4c2fb32c74645046be (patch) | |
tree | 7b01c0969ccb342edb155bce41a47edb343d8ea2 /examples | |
parent | 3682e4700db6b8cb2ca8e3da365578078f21ab0c (diff) |
R4 improvements on ARM_NEON (#125)
* q4_0_r4: 6% faster PP on NEON
* qx_0_r4_q8_0 template
Applied to q4_0_r4 and q5_0_r4. It makes q5_0_r4 PP
~7% faster.
* Apply qx_0_r4_q8_0 template also to q6_0_r4 and iq4_nl_x4
* Simplify
* Minor iq4_xs_r4 improvement on NEON
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'examples')
0 files changed, 0 insertions, 0 deletions