index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
gguf-py
/
gguf
/
tensor_mapping.py
Age
Commit message (
Expand
)
Author
2025-07-10
Support for dots.llm1 models (#573)
saood06
2025-05-09
Fix missing rope_freqs with convert_hf_to_gguf (#402)
saood06
2025-05-09
Support for Llama-3-Nemotron models (#377)
saood06
2025-04-22
Add support for bitnet2b_2501 model (#337)
saood06
2025-02-09
Add optional MLA (#188)
Kawrakow
2025-01-23
Deepseek V3 support added (#176)
saood06
2024-07-27
Merge mainline llama.cpp (#3)
Kawrakow
2024-06-22
bitnet: python + llama
Iwan Kawrakow
2024-06-06
llama : add jina v2 base code (#7596)
Joan Fontanals
2024-05-28
Add support for DeepseekV2ForCausalLM (#7519)
fairydreaming
2024-05-24
Add support for ArcticForCausalLM (#7020)
fairydreaming
2024-05-11
llama : add Jina Embeddings architecture (#6826)
Joan Fontanals
2024-05-11
ggml : full ALiBi support (#7192)
Georgi Gerganov
2024-04-24
llama : add phi3 support (#6852)
liuwei-git
2024-04-16
llama : add qwen2moe (#6074)
Shijie
2024-04-13
model: support arch `DbrxForCausalLM` (#6515)
Pierrick Hymbert
2024-04-09
llama : add Command R Plus support (#6491)
Carolinabanana
2024-04-03
llama : add SEA-LION support (#6448)
bryanSwk
2024-04-03
ggml : mul_mat_id use the same tensor for all the experts (#6387)
slaren
2024-03-23
llama : add grok-1 support (#6204)
Julius Arkenberg
2024-03-08
llama : support Mamba Selective State Space Models (#5328)
compilade
2024-03-01
llama : add StarCoder2 support (#5795)
Sourab Mangrulkar
2024-02-13
llama : add support for Nomic Embed (#5468)
Jared Van Bortel
2024-02-11
Add support for BERT embedding models (#5423)
Douglas Hanley
2024-02-01
llama : support InternLM2 (#5184)
Guoteng
2024-01-19
llama : add CodeShell support (#5016)
chiranko
2024-01-13
convert : update phi-2 to latest HF repo (#4903)
Georgi Gerganov
2024-01-12
llama : fix llm_build_k_shift to use correct n_rot (#4889)
Georgi Gerganov
2023-12-28
gpt2 : Add gpt2 architecture integration (#4555)
manikbhandari
2023-12-27
llama : add AWQ for llama, llama2, mpt, and mistral models (#4593)
Nam D. Tran
2023-12-24
llama : add PLaMo model (#3557)
Shintarou Okada
2023-12-18
llama : add phi-2 + fix NeoX rope + ggml_mul_mat_set_prec (#4490)
Ebey Abraham
2023-12-13
llama : add Mixtral support (#4406)
slaren
2023-12-01
llama : add Qwen support (#4281)
Shijie
2023-11-11
gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)
Kerfuffle