summaryrefslogtreecommitdiff
path: root/gguf-py/gguf/gguf_writer.py
AgeCommit message (Expand)Author
2024-02-15Use correct type of pooling for embedding models (#5500)Douglas Hanley
2024-02-15fix(gguf-py): special tokens are no longer skipped when add_<token>_token is ...Michaƫl de Vries
2024-02-13llama : support batched embeddings (#5466)Douglas Hanley
2024-02-11Add support for BERT embedding models (#5423)Douglas Hanley
2024-02-01llama : support InternLM2 (#5184)Guoteng
2024-01-02llama : differentiate the KV dims in the attention (#4657)postmasters
2023-12-13llama : add Mixtral support (#4406)slaren
2023-11-20ci : add flake8 to github actions (python linting) (#4129)Galunid
2023-11-19gguf-py : export chat templates (#4125)slaren
2023-11-12gguf-py: gguf_writer: Use bytearray to build metadata (#4051)Kerfuffle
2023-11-11gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)Kerfuffle