Age | Commit message (Expand) | Author |
---|---|---|
2023-08-22 | CUDA: use mul_mat_q kernels by default (#2683) | Johannes Gäßler |
2023-08-22 | ggml-cuda : use graph allocator (#2684) | slaren |
2023-08-21 | gguf : new file format with flexible meta data (beta) (#2398) | Georgi Gerganov |