index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
gguf-py
Age
Commit message (
Expand
)
Author
2024-01-13
convert : update phi-2 to latest HF repo (#4903)
Georgi Gerganov
2024-01-12
llama : fix llm_build_k_shift to use correct n_rot (#4889)
Georgi Gerganov
2024-01-02
llama : differentiate the KV dims in the attention (#4657)
postmasters
2023-12-28
gpt2 : Add gpt2 architecture integration (#4555)
manikbhandari
2023-12-27
llama : add AWQ for llama, llama2, mpt, and mistral models (#4593)
Nam D. Tran
2023-12-24
llama : add PLaMo model (#3557)
Shintarou Okada
2023-12-21
gguf-py : fix broken link
Georgi Gerganov
2023-12-21
py : open merges file as 'utf-8' (#4566)
howlger
2023-12-18
llama : add phi-2 + fix NeoX rope + ggml_mul_mat_set_prec (#4490)
Ebey Abraham
2023-12-17
gguf-py : fail fast on nonsensical special token IDs (#4489)
Jared Van Bortel
2023-12-13
llama : add Mixtral support (#4406)
slaren
2023-12-12
english : use `typos` to fix comments and logs (#4354)
Richard Kiss
2023-12-01
llama : add Qwen support (#4281)
Shijie
2023-11-20
ci : add flake8 to github actions (python linting) (#4129)
Galunid
2023-11-19
gguf-py : export chat templates (#4125)
slaren
2023-11-16
Respect tokenizer.ggml.add_bos_token value when tokenizing (#4040)
Kerfuffle
2023-11-14
stablelm : StableLM support (#3586)
Galunid
2023-11-12
gguf-py: gguf_writer: Use bytearray to build metadata (#4051)
Kerfuffle
2023-11-11
Fix gguf-convert-endian script (#4037)
M. Yusuf Sarıgöz
2023-11-11
gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)
Kerfuffle
2023-11-07
gguf : track writer state, free unneeded tensors, cleanup (#3871)
Jared Van Bortel
2023-11-04
gguf-py: Support 01.AI Yi models (#3943)
Kerfuffle
2023-11-01
llama : implement YaRN RoPE scaling (#2268)
cebtenzzre
2023-10-22
llama : validate special token ids are in range when loading GGUF model (#3635)
Kerfuffle
2023-10-20
gguf : support big endian platform (#3552)
Qin Yue Chen
2023-10-10
llm : add bloom models (#3553)
Xingchen Song(宋星辰)
2023-10-07
gguf.py : fix CI for publishing GGUF package (#3532)
M. Yusuf Sarıgöz
2023-10-07
llm : support Adept Persimmon 8B (#3410)
Phillip Kravtsov
2023-10-04
llm : add Refact model (#3329)
ds5t5
2023-10-02
gguf : add BERT, MPT, and GPT-J arch info (#3408)
cebtenzzre
2023-10-02
gguf : general usability improvements (#3409)
cebtenzzre
2023-09-27
gguf : fix a few general keys (#3341)
Cebtenzzre
2023-09-15
llama : add support for StarCoder model architectures (#3187)
Meng Zhang
2023-09-14
gguf-py : support identity operation in TensorNameMap (#3095)
Kerfuffle
2023-09-14
feature : support Baichuan serial models (#3009)
jameswu2014
2023-09-03
gguf(python): Fix special vocab handling when id < 0 (#2984)
Kerfuffle
2023-08-31
convert : fix python 3.8 support, modernize type annotations (#2916)
Cebtenzzre
2023-08-30
gguf : add workflow for Pypi publishing (#2896)
M. Yusuf Sarıgöz
2023-08-30
convert : various script cleanups/fixes + merges and special token handling (...
Kerfuffle
2023-08-27
gguf : add 64-bit support (GGUF v2) (#2821)
Georgi Gerganov
2023-08-25
gguf : export objects to user code (#2780)
M. Yusuf Sarıgöz
2023-08-25
gguf : make gguf pip-installable
M. Yusuf Sarıgöz