summaryrefslogtreecommitdiff
path: root/ggml/src/ggml-cuda/template-instances/generate_cu_files.py
AgeCommit message (Expand)Author
2024-11-21MMQ for Q6_0 (#115)Kawrakow
2024-10-22Enable q6_0 for flash attention (#101)Kawrakow
2024-10-21Enable IQ4_NL for KV-cache in token generation using Flash Attention (#99)Kawrakow
2024-07-27Merge mainline llama.cpp (#3)Kawrakow