ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2024-04-29	llama : fix BPE pre-tokenization (#6920)	Georgi Gerganov
2024-04-24	llama : add phi3 support (#6852)	liuwei-git
2024-04-21	gguf-py : add IQ1_M to GGML_QUANT_SIZES (#6761)	pmysl
2024-04-19	Implement the OLMo architecture (#6741)	nopperl
2024-04-18	convert : support models with multiple chat templates (#6588)	Sigbjørn Skjæret
2024-04-16	llama : add StableLM2 12B (#6635)	Ashish
2024-04-16	llama : add qwen2moe (#6074)	Shijie
2024-04-16	gguf : add special tokens metadata for FIM/Infill (#6689)	Daniel Bevenius
2024-04-13	model: support arch `DbrxForCausalLM` (#6515)	Pierrick Hymbert
2024-04-09	llama : add Command R Plus support (#6491)	Carolinabanana
2024-04-05	gguf.py : add licence and version to gguf writer (#6504)	Brian
2024-04-03	llama : add SEA-LION support (#6448)	bryanSwk
2024-04-03	ggml : mul_mat_id use the same tensor for all the experts (#6387)	slaren
2024-03-29	[Model] Add support for xverse (#6301)	hxer7963
2024-03-26	IQ1_M: 1.75 bpw quantization (#6302)	Kawrakow
2024-03-23	llama : add grok-1 support (#6204)	Julius Arkenberg
2024-03-15	llama : add Command-R support (#6033)	Andrew Canis
2024-03-15	gguf : add support for I64 and F64 arrays (#6062)	Ondřej Čertík
2024-03-14	llama : support models without vocabulary (#5798)	Michael Podvitskiy
2024-03-14	gguf-py : add support for I8, I16 and I32 (#6045)	Ondřej Čertík
2024-03-08	llama : support Mamba Selective State Space Models (#5328)	compilade
2024-03-03	gguf-dump : support i-quants (#5841)	Nindaleth
2024-03-01	llama : add StarCoder2 support (#5795)	Sourab Mangrulkar
2024-02-21	llama : add `gemma` model (#5631)	postmasters
2024-02-15	Use correct type of pooling for embedding models (#5500)	Douglas Hanley
2024-02-15	fix(gguf-py): special tokens are no longer skipped when add_<token>_token is ...	Michaël de Vries
2024-02-13	llama : add support for Nomic Embed (#5468)	Jared Van Bortel
2024-02-13	llama : support batched embeddings (#5466)	Douglas Hanley
2024-02-11	Add support for BERT embedding models (#5423)	Douglas Hanley
2024-02-07	llama : add MiniCPM support (#5346)	runfuture
2024-02-01	llama : support InternLM2 (#5184)	Guoteng
2024-01-28	llama : add support for Orion-14B (#5118)	sharpHL
2024-01-19	llama : support upcoming Qwen2 (#5037)	Shijie
2024-01-19	llama : add CodeShell support (#5016)	chiranko
2024-01-13	convert : update phi-2 to latest HF repo (#4903)	Georgi Gerganov
2024-01-02	llama : differentiate the KV dims in the attention (#4657)	postmasters
2023-12-28	gpt2 : Add gpt2 architecture integration (#4555)	manikbhandari
2023-12-27	llama : add AWQ for llama, llama2, mpt, and mistral models (#4593)	Nam D. Tran
2023-12-24	llama : add PLaMo model (#3557)	Shintarou Okada
2023-12-18	llama : add phi-2 + fix NeoX rope + ggml_mul_mat_set_prec (#4490)	Ebey Abraham
2023-12-13	llama : add Mixtral support (#4406)	slaren
2023-12-01	llama : add Qwen support (#4281)	Shijie
2023-11-19	gguf-py : export chat templates (#4125)	slaren
2023-11-14	stablelm : StableLM support (#3586)	Galunid
2023-11-11	gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)	Kerfuffle