summaryrefslogtreecommitdiff
path: root/tests/test-backend-ops.cpp
diff options
context:
space:
mode:
authorIwan Kawrakow <iwan.kawrakow@gmail.com>2024-06-20 19:23:10 +0300
committerIwan Kawrakow <iwan.kawrakow@gmail.com>2024-06-22 12:02:52 +0300
commit729ba46f774a4ba9af48ce6708da653ee80d2296 (patch)
tree1a82e24c139d77a8879d85bc8bbb9e3ec8f18ccd /tests/test-backend-ops.cpp
parentf0325c5826c55bb9796485d49bc971a17735e96a (diff)
bitnet(scale in a separate tensor): CPU tweaks
I had ruined TG performance on AVX2 with the last commit. Was just testing at 8 threads and there we are totally memory bound. But at 4 threads we had regressed to 41 t/s on the Ryzen7950. Back to 51 t/s with this commit.
Diffstat (limited to 'tests/test-backend-ops.cpp')
0 files changed, 0 insertions, 0 deletions