index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
gguf-py
/
gguf
/
constants.py
Age
Commit message (
Expand
)
Author
2024-02-15
Use correct type of pooling for embedding models (#5500)
Douglas Hanley
2024-02-15
fix(gguf-py): special tokens are no longer skipped when add_<token>_token is ...
Michaƫl de Vries
2024-02-13
llama : add support for Nomic Embed (#5468)
Jared Van Bortel
2024-02-13
llama : support batched embeddings (#5466)
Douglas Hanley
2024-02-11
Add support for BERT embedding models (#5423)
Douglas Hanley
2024-02-07
llama : add MiniCPM support (#5346)
runfuture
2024-02-01
llama : support InternLM2 (#5184)
Guoteng
2024-01-28
llama : add support for Orion-14B (#5118)
sharpHL
2024-01-19
llama : support upcoming Qwen2 (#5037)
Shijie
2024-01-19
llama : add CodeShell support (#5016)
chiranko
2024-01-13
convert : update phi-2 to latest HF repo (#4903)
Georgi Gerganov
2024-01-02
llama : differentiate the KV dims in the attention (#4657)
postmasters
2023-12-28
gpt2 : Add gpt2 architecture integration (#4555)
manikbhandari
2023-12-27
llama : add AWQ for llama, llama2, mpt, and mistral models (#4593)
Nam D. Tran
2023-12-24
llama : add PLaMo model (#3557)
Shintarou Okada
2023-12-18
llama : add phi-2 + fix NeoX rope + ggml_mul_mat_set_prec (#4490)
Ebey Abraham
2023-12-13
llama : add Mixtral support (#4406)
slaren
2023-12-01
llama : add Qwen support (#4281)
Shijie
2023-11-19
gguf-py : export chat templates (#4125)
slaren
2023-11-14
stablelm : StableLM support (#3586)
Galunid
2023-11-11
gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)
Kerfuffle