summaryrefslogtreecommitdiff
path: root/gguf-py
AgeCommit message (Expand)Author
2024-01-13convert : update phi-2 to latest HF repo (#4903)Georgi Gerganov
2024-01-12llama : fix llm_build_k_shift to use correct n_rot (#4889)Georgi Gerganov
2024-01-02llama : differentiate the KV dims in the attention (#4657)postmasters
2023-12-28gpt2 : Add gpt2 architecture integration (#4555)manikbhandari
2023-12-27llama : add AWQ for llama, llama2, mpt, and mistral models (#4593)Nam D. Tran
2023-12-24llama : add PLaMo model (#3557)Shintarou Okada
2023-12-21gguf-py : fix broken linkGeorgi Gerganov
2023-12-21py : open merges file as 'utf-8' (#4566)howlger
2023-12-18llama : add phi-2 + fix NeoX rope + ggml_mul_mat_set_prec (#4490)Ebey Abraham
2023-12-17gguf-py : fail fast on nonsensical special token IDs (#4489)Jared Van Bortel
2023-12-13llama : add Mixtral support (#4406)slaren
2023-12-12english : use `typos` to fix comments and logs (#4354)Richard Kiss
2023-12-01llama : add Qwen support (#4281)Shijie
2023-11-20ci : add flake8 to github actions (python linting) (#4129)Galunid
2023-11-19gguf-py : export chat templates (#4125)slaren
2023-11-16Respect tokenizer.ggml.add_bos_token value when tokenizing (#4040)Kerfuffle
2023-11-14stablelm : StableLM support (#3586)Galunid
2023-11-12gguf-py: gguf_writer: Use bytearray to build metadata (#4051)Kerfuffle
2023-11-11Fix gguf-convert-endian script (#4037)M. Yusuf Sarıgöz
2023-11-11gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)Kerfuffle
2023-11-07gguf : track writer state, free unneeded tensors, cleanup (#3871)Jared Van Bortel
2023-11-04gguf-py: Support 01.AI Yi models (#3943)Kerfuffle
2023-11-01llama : implement YaRN RoPE scaling (#2268)cebtenzzre
2023-10-22llama : validate special token ids are in range when loading GGUF model (#3635)Kerfuffle
2023-10-20gguf : support big endian platform (#3552)Qin Yue Chen
2023-10-10llm : add bloom models (#3553)Xingchen Song(宋星辰)
2023-10-07gguf.py : fix CI for publishing GGUF package (#3532)M. Yusuf Sarıgöz
2023-10-07llm : support Adept Persimmon 8B (#3410)Phillip Kravtsov
2023-10-04llm : add Refact model (#3329)ds5t5
2023-10-02gguf : add BERT, MPT, and GPT-J arch info (#3408)cebtenzzre
2023-10-02gguf : general usability improvements (#3409)cebtenzzre
2023-09-27gguf : fix a few general keys (#3341)Cebtenzzre
2023-09-15llama : add support for StarCoder model architectures (#3187)Meng Zhang
2023-09-14gguf-py : support identity operation in TensorNameMap (#3095)Kerfuffle
2023-09-14feature : support Baichuan serial models (#3009)jameswu2014
2023-09-03gguf(python): Fix special vocab handling when id < 0 (#2984)Kerfuffle
2023-08-31convert : fix python 3.8 support, modernize type annotations (#2916)Cebtenzzre
2023-08-30gguf : add workflow for Pypi publishing (#2896)M. Yusuf Sarıgöz
2023-08-30convert : various script cleanups/fixes + merges and special token handling (...Kerfuffle
2023-08-27gguf : add 64-bit support (GGUF v2) (#2821)Georgi Gerganov
2023-08-25gguf : export objects to user code (#2780)M. Yusuf Sarıgöz
2023-08-25gguf : make gguf pip-installableM. Yusuf Sarıgöz