summaryrefslogtreecommitdiff
path: root/gguf-py/gguf/tensor_mapping.py
diff options
context:
space:
mode:
authorKawrakow <iwankawrakow@gmail.com>2025-04-17 08:08:40 +0200
committerGitHub <noreply@github.com>2025-04-17 08:08:40 +0200
commit3bb64d9330d5336d76b036535474d8a4b273373c (patch)
tree3724f7c8abc20b467b756f8a498be7c619831a68 /gguf-py/gguf/tensor_mapping.py
parentf7c5a94e756e4add4d531d295ae23493d9857508 (diff)
Better TG performance for GQA models (CPU) (#332)
* Slightly better CPU TG performance for GQA * Better CPU FA implementation for TG when GQA * Minor --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'gguf-py/gguf/tensor_mapping.py')
0 files changed, 0 insertions, 0 deletions