Age | Commit message (Expand) | Author |
---|---|---|
2023-10-11 | common : fix mirostat state when using multiple sequences (#3543) | Kerfuffle |
2023-10-03 | llama : fix session saving/loading (#3400) | Georgi Gerganov |
2023-09-28 | llama.cpp : split llama_context_params into model and context params (#3301) | slaren |
2023-09-28 | llama : custom attention mask + parallel decoding + no context swaps (#3228) | Georgi Gerganov |
2023-09-14 | speculative : add heuristic algorithm (#3006) | Leng Yue |
2023-09-13 | speculative: add --n-gpu-layers-draft option (#3063) | FK |
2023-09-08 | build : do not use _GNU_SOURCE gratuitously (#2035) | Przemysław Pawełczyk |
2023-09-05 | speculative : add grammar support (#2991) | Georgi Gerganov |
2023-09-03 | speculative : PoC for speeding-up inference via speculative sampling (#2926) | Georgi Gerganov |