Age | Commit message (Expand) | Author |
---|---|---|
2025-06-08 | Fix non rpc build error (#506) | firecoperana |
2025-06-08 | Revert "Rpc improvement (#480)" | Iwan Kawrakow |
2025-06-08 | Rpc improvement (#480) | firecoperana |
2025-05-20 | Bug fixes from mainline (#439) | Kawrakow |
2025-05-12 | GPU offload policy (#405) | Kawrakow |
2025-03-13 | FlashMLA-2 (CPU): faster and smaller compute buffer size (#253) | Kawrakow |
2025-02-25 | Give the user the option to override where model weights are stored (#232) | Kawrakow |
2024-10-25 | Bitnet changes (#106) | Kawrakow |
2024-10-20 | Avoid rebuild of GGML graph for each token (#98) | agray3 |
2024-08-12 | Merge mainline - Aug 12 2024 (#17) | Kawrakow |
2024-07-27 | Merge mainline llama.cpp (#3) | Kawrakow |