diff options
author | Kawrakow <48489457+ikawrakow@users.noreply.github.com> | 2024-09-01 16:08:21 +0300 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-09-01 16:08:21 +0300 |
commit | dc023bc3be1a7ac42d1030f86c4d77563a019286 (patch) | |
tree | 565cc8a7be7d54ac164c0e7efc23b9dadf06cd92 /examples/pydantic_models_to_grammar.py | |
parent | dbb1db989991025881679a60b0a81a92d2fa471b (diff) |
Zen4 Flash Attention (#32)
* Zen4 flash attention: moving useful parts from the kq_fused_softmax branch
* Add flash attention with soft-cap and fix D = 256 case
* Flash attention refinements
* Update FlashAttn comment
---------
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'examples/pydantic_models_to_grammar.py')
0 files changed, 0 insertions, 0 deletions