Age | Commit message (Expand) | Author |
---|---|---|
2024-03-12 | ggml : reuse quantum structs across backends (#5943) | Georgi Gerganov |
2024-03-11 | 1.5 bit: we can do even better (#5999) | Kawrakow |
2024-03-11 | Better 1.5 bit quantization (#5971) | Kawrakow |
2024-03-10 | ggml : remove __constant__ specifier for CUDA tables (#5940) | Georgi Gerganov |
2024-03-09 | ggml : add ggml-common.h to deduplicate shared code (#5940) | Georgi Gerganov |