Age | Commit message (Expand) | Author |
---|---|---|
2024-07-27 | Merge mainline llama.cpp (#3) | Kawrakow |
2024-05-25 | gguf-py : fix and simplify quantized shape round-trip (#7483) | compilade |
2024-05-08 | convert-hf : save memory with lazy evaluation (#7075) | compilade |
2024-05-03 | convert.py : add python logging instead of print() (#6511) | Brian |
2024-04-29 | llama : fix BPE pre-tokenization (#6920) | Georgi Gerganov |
2024-04-28 | gguf : enforce that tensor names are unique (#6905) | Xuan Son Nguyen |
2024-03-15 | gguf : add support for I64 and F64 arrays (#6062) | Ondřej Čertík |
2024-03-14 | gguf-py : add support for I8, I16 and I32 (#6045) | Ondřej Čertík |
2024-01-26 | gguf : fix "general.alignment" type in gguf_reader.py (#5136) | Riceball LEE |
2023-11-11 | gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981) | Kerfuffle |