ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2024-02-09	llava : add requirements.txt and update README.md (#5428)	Daniel Bevenius
2024-02-09	server : fix prompt caching for repeated prompts (#5420)	Riley Stewart
2024-02-08	llava : add missing .py, and fix paths in README.md (#5414)	Daniel Bevenius
2024-02-08	llava: fix typo/formatting in README.md (#5405)	Daniel Bevenius
2024-02-07	llava-cli : always tokenize special tokens (#5382)	Xiao-Yong Jin
2024-02-07	server : update `/props` with "total_slots" value (#5373)	Justin Parker
2024-02-06	server : remove model.json endpoint (#5371)	Alexey Parfenov
2024-02-06	server : include total "num_slots" in props endpoint (#5349)	Justin Parker
2024-02-06	server : add `dynatemp_range` and `dynatemp_exponent` (#5352)	Michael Coppola
2024-02-06	server : various fixes for the prompt field in /completion (#5300)	Niall Coates
2024-02-05	server : allow to get default generation settings for completion (#5307)	Alexey Parfenov
2024-02-04	Adding some imatrix tools (#5302)	Kawrakow
2024-02-03	refactor : switch to emplace_back to avoid extra object (#5291)	Michael Klimenko
2024-02-02	perplexity : fix KL divergence calculations on Windows (#5273)	kalomaze
2024-02-02	[SYCL] update guide of SYCL backend (#5254)	Neo Zhang Jianyu
2024-02-01	add --no-mmap in llama-bench (#5257)	Neo Zhang Jianyu
2024-01-31	llama : remove LLAMA_MAX_DEVICES and LLAMA_SUPPORTS_GPU_OFFLOAD (#5240)	Georgi Gerganov
2024-01-31	llava : add MobileVLM support (#5132)	JidongZhang-THU
2024-01-31	format license text, restore apache license by legal suggestion (#5233)	Neo Zhang Jianyu
2024-01-31	support SYCL backend windows build (#5208)	Neo Zhang Jianyu
2024-01-30	kompute : llama-bench support and ggml_cpu_has_kompute() (#5226)	Jared Van Bortel
2024-01-30	Revert "server : change deps.sh xxd files to string literals (#5221)"	Georgi Gerganov
2024-01-30	server : fix context shift (#5195)	Georgi Gerganov
2024-01-30	server : change deps.sh xxd files to string literals (#5221)	JohnnyB
2024-01-30	SOTA 3-bit quants (#5196)	Kawrakow
2024-01-30	quantize : fix typo (#5211)	Vladimir Malyutin
2024-01-30	main : allow empty --prompt-cache file (#5176)	divinity76
2024-01-30	server : improve README (#5209)	Wu Jian Ping
2024-01-29	server : embeddings compatibility for OpenAI (#5190)	Wu Jian Ping
2024-01-28	ggml : add Vulkan backend (#2059)	0cc4m
2024-01-28	ggml : add unified SYCL backend for Intel GPUs (#2690)	Abhilash Majumder
2024-01-28	docker : add server-first container images (#5157)	Kyle Mistele
2024-01-27	llava : support for Yi-VL and fix for mobileVLM (#5093)	John
2024-01-27	sync : ggml	Georgi Gerganov
2024-01-27	Remove unused data and add fixes (#5154)	Michael Klimenko
2024-01-27	server : add self-extend support (#5104)	Maximilian Winter
2024-01-26	server : refactored the task processing logic (#5065)	Xuan Son Nguyen
2024-01-25	examples : make pydantic scripts pass mypy and support py3.8 (#5099)	Jared Van Bortel
2024-01-25	android : use release cmake build type by default (#5123)	Valentin Konovalov
2024-01-23	Additional KL-divergence statistics (#5081)	Kawrakow
2024-01-23	minor : clean-up some warnings and style (#5094)	Georgi Gerganov
2024-01-23	llama.vim : added api key support (#5090)	Michael Coppola
2024-01-22	KL-divergence (#5076)	Kawrakow
2024-01-22	llava : MobileVLM support (#4954)	XiaotaoChen
2024-01-22	imatrix : keep intermediate imatrix results (#5077)	Kawrakow
2024-01-22	finetune : print sample-start/include-sample-start (#5072)	Daniel Bevenius
2024-01-22	llama : add Q3_K_XS (#5060)	Kawrakow
2024-01-21	Add ability to evauate multiple choice tasks (#5047)	Kawrakow
2024-01-21	Slightly faster imatrix (#5050)	Kawrakow
2024-01-20	perplexity : fix MSVC build after #5020 (#5043)	Jared Van Bortel