Age | Commit message (Expand) | Author |
---|---|---|
2024-11-21 | MMQ for Q6_0 (#115) | Kawrakow |
2024-10-22 | Enable q6_0 for flash attention (#101) | Kawrakow |
2024-10-21 | Enable IQ4_NL for KV-cache in token generation using Flash Attention (#99) | Kawrakow |
2024-07-27 | Merge mainline llama.cpp (#3) | Kawrakow |