Age | Commit message (Expand) | Author |
---|---|---|
2024-06-05 | ggml : refactor rope norm/neox (#7634) | Georgi Gerganov |
2024-05-23 | ggml : remove ggml_flash_attn and ggml_flash_ff (#7463) | Georgi Gerganov |
2023-12-24 | cuda : improve cuda pool efficiency using virtual memory (#4606) | slaren |
2023-12-21 | ggml : change ggml_scale to take a float instead of tensor (#4573) | Georgi Gerganov |
2023-12-12 | english : use `typos` to fix comments and logs (#4354) | Richard Kiss |
2023-11-13 | sync : ggml (backend v2) (#3912) | Georgi Gerganov |
2023-10-04 | sync : ggml (conv 1d + 2d updates, UB fixes) (#3468) | Georgi Gerganov |
2023-09-28 | build : enable more non-default compiler warnings (#3200) | Cebtenzzre |
2023-09-28 | train : finetune LORA (#2632) | xaedes |
2023-09-28 | llama : custom attention mask + parallel decoding + no context swaps (#3228) | Georgi Gerganov |
2023-08-28 | train : mem usage and other improvements (#2439) | xaedes |
2023-08-02 | tests : Fix compilation warnings (Linux/GCC) (#2451) | Eve |