summaryrefslogtreecommitdiff
path: root/examples/pydantic_models_to_grammar.py
diff options
context:
space:
mode:
authorKawrakow <48489457+ikawrakow@users.noreply.github.com>2024-09-01 16:08:21 +0300
committerGitHub <noreply@github.com>2024-09-01 16:08:21 +0300
commitdc023bc3be1a7ac42d1030f86c4d77563a019286 (patch)
tree565cc8a7be7d54ac164c0e7efc23b9dadf06cd92 /examples/pydantic_models_to_grammar.py
parentdbb1db989991025881679a60b0a81a92d2fa471b (diff)
Zen4 Flash Attention (#32)
* Zen4 flash attention: moving useful parts from the kq_fused_softmax branch * Add flash attention with soft-cap and fix D = 256 case * Flash attention refinements * Update FlashAttn comment --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'examples/pydantic_models_to_grammar.py')
0 files changed, 0 insertions, 0 deletions