| Age | Commit message (Expand) | Author |
|---|---|---|
| 2024-04-18 | ggml : group all experts in a single ggml_mul_mat_id (#6505) | slaren |
| 2024-04-09 | llama : add Command R Plus support (#6491) | Carolinabanana |
| 2024-04-03 | ggml : mul_mat_id use the same tensor for all the experts (#6387) | slaren |
| 2024-03-29 | sync : ggml (#6351) | Georgi Gerganov |
| 2024-03-26 | IQ1_M: 1.75 bpw quantization (#6302) | Kawrakow |
| 2024-03-25 | cuda : fix LLAMA_CUDA_F16 build (#6298) | slaren |
| 2024-03-25 | cuda : refactor into multiple files (#6269) | slaren |
