ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2024-03-29	Vulkan k-quant mmq and ggml-backend offload functionality (#6155)	0cc4m
2024-03-29	sync : ggml (#6351)	Georgi Gerganov
2024-03-29	[Model] Add support for xverse (#6301)	hxer7963
2024-03-29	ci : fix BGE wget (#6383)	Georgi Gerganov
2024-03-29	readme : add project (#6356)	zhouwg
2024-03-29	cmake : add explicit metal version options (#6370)	Matt Clayton
2024-03-29	llama : remove redundant reshape in build_kv_store (#6369)	Daniel Bevenius
2024-03-29	convert : allow conversion of Mistral HF models (#6144)	Pedro Cuenca
2024-03-28	readme : add notice for UI list	Georgi Gerganov
2024-03-28	[SYCL] Revisited & updated SYCL build documentation (#6141)	Ouadie EL FAROUKI
2024-03-28	convert : refactor vocab selection logic (#6355)	Jared Van Bortel
2024-03-28	llava : fix MobileVLM (#6364)	Ziang Wu
2024-03-28	llama : fix command-r inference when omitting outputs (#6367)	compilade
2024-03-28	ci: bench: fix master not schedule, fix commit status failed on external repo...	Pierrick Hymbert
2024-03-28	doc: fix outdated default value of batch size (#6336)	Ting Sun
2024-03-28	server : stop gracefully on SIGTERM (#6348)	Eric Zhang
2024-03-28	nix: removed unnessesary indentation	hutli
2024-03-28	nix: moved blas availability check to package inputs so it is still overridable	hutli
2024-03-28	using blas.meta.available to check host platform	hutli
2024-03-28	only using explicit blas if hostPlatform is allowed	hutli
2024-03-28	nix: .#windows: proper cross-compilation set-up	Someone Serge
2024-03-28	nix: package: don't introduce the dependency on python	Someone Serge
2024-03-28	nix: .#widnows: init	hutli
2024-03-28	doc: fix typo in MobileVLM-README.md (#6181)	Ziang Wu
2024-03-28	[SYCL] fix set main gpu crash (#6339)	Neo Zhang Jianyu
2024-03-27	server: continuous performance monitoring and PR comment (#6283)	Pierrick Hymbert
2024-03-27	nix: ci: dont test cuda and rocm (for now)	Someone Serge
2024-03-27	ggml : fix bounds checking of zero size views (#6347)	slaren
2024-03-27	make : whitespace	Georgi Gerganov
2024-03-27	embedding : show full embedding for single prompt (#6342)	howlger
2024-03-27	[SYCL] Fix batched impl for NVidia GPU (#6164)	AidanBeltonS
2024-03-27	Make IQ1_M work for QK_K = 64 (#6327)	Kawrakow
2024-03-27	common : change --no-penalize-nl to --penalize-nl (#6334)	Sigbjørn Skjæret
2024-03-27	llama2c : open file as binary (#6332)	Georgi Gerganov
2024-03-27	readme : add php api bindings (#6326)	Mateusz Charytoniuk
2024-03-27	server: public: use relative routes for static files (#6325)	Eric Zhang
2024-03-27	[SYCL] fix no file in win rel (#6314)	Neo Zhang Jianyu
2024-03-26	wpm : portable unicode tolower (#6305)	Jared Van Bortel
2024-03-26	llama : greatly reduce output buffer memory usage (#6122)	compilade
2024-03-26	IQ1_M: 1.75 bpw quantization (#6302)	Kawrakow
2024-03-26	convert-hf : fix exception in sentencepiece with added tokens (#6320)	Pedro Cuenca
2024-03-26	quantize : be able to override metadata by key (#6321)	Kawrakow
2024-03-26	embedding : adjust `n_ubatch` value (#6296)	Minsoo Cheong
2024-03-26	server : add `n_discard` parameter (#6300)	Jan Boon
2024-03-25	nix: make `xcrun` visible in Nix sandbox for precompiling Metal shaders (#6118)	Joseph Stahl
2024-03-26	cuda : rename build flag to LLAMA_CUDA (#6299)	slaren
2024-03-25	nix: fix blas support (#6281)	Christian Kögler
2024-03-25	tests : include IQ2_XXS and IQ2_XS in test-quantize-fns (#6303)	Kawrakow
2024-03-25	flake.lock: Update (#6266)	Georgi Gerganov
2024-03-25	cuda : fix LLAMA_CUDA_F16 build (#6298)	slaren