ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2023-11-13	ggml : sync (im2col, GPU conv, 32-bit arm compat) (#4060)	Georgi Gerganov
2023-11-13	readme : update hot topics	Georgi Gerganov
2023-11-13	sync : ggml (backend v2) (#3912)	Georgi Gerganov
2023-11-13	Add ReLU and SQR CUDA ops to (partially) fix Persimmon offloading (#4041)	Kerfuffle
2023-11-12	gguf-py: gguf_writer: Use bytearray to build metadata (#4051)	Kerfuffle
2023-11-11	Fix some documentation typos/grammar mistakes (#4032)	Richard Kiss
2023-11-11	Fix gguf-convert-endian script (#4037)	M. Yusuf Sarıgöz
2023-11-10	server : fix crash when prompt exceeds context size (#3996)	Alexey Parfenov
2023-11-11	gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981)	Kerfuffle
2023-11-10	server : allow continue edit on completion mode (#3950)	Jhen-Jie Hong
2023-11-10	Unbreak persimmon after #3837 (#4010)	Galunid
2023-11-09	scripts: Generalize convert scripts (#3838)	Galunid
2023-11-08	server : add min_p param (#3877)	Mihai
2023-11-08	ggml-alloc : fix backend assignments of views (#3982)	slaren
2023-11-07	gguf : track writer state, free unneeded tensors, cleanup (#3871)	Jared Van Bortel
2023-11-07	make : do not add linker flags when compiling static llava lib (#3977)	Georgi Gerganov
2023-11-07	ggml : fix backward rope after YaRN (#3974)	xaedes
2023-11-07	Use params when loading models in llava-cli (#3976)	Matthew Tejo
2023-11-07	cuda : supports running on CPU for GGML_USE_CUBLAS=ON build (#3946)	Meng Zhang
2023-11-07	llava : expose as a shared library for downstream projects (#3613)	Damian Stewart
2023-11-05	ggml-cuda : fix f16 mul mat (#3961)	slaren
2023-11-05	Allow common process_escapes to handle \x sequences (#3928)	Kerfuffle
2023-11-05	server : fix typo for --alias shortcut from -m to -a (#3958)	Thái Hoàng Tâm
2023-11-05	cuda : fix disabling device with --tensor-split 1,0 (#3951)	Jared Van Bortel
2023-11-05	llama : mark LLM_ARCH_STARCODER as full offload supported (#3945)	Meng Zhang
2023-11-05	cmake : MSVC instruction detection (fixed up #809) (#3923)	Eve
2023-11-05	ci : use intel sde when ci cpu doesn't support avx512 (#3949)	Eve
2023-11-05	cuda : revert CUDA pool stuff (#3944)	slaren
2023-11-04	gguf-py: Support 01.AI Yi models (#3943)	Kerfuffle
2023-11-03	metal : round up to 16 to fix MTLDebugComputeCommandEncoder assertion (#3938)	Peter Sugihara
2023-11-03	ggml-metal: fix yarn rope (#3937)	Xiao-Yong Jin
2023-11-03	ggml-cuda : move row numbers to x grid dim in mmv kernels (#3921)	slaren
2023-11-03	speculative : change default p_accept to 0.5 + CLI args (#3919)	Georgi Gerganov
2023-11-03	common : YAYF (yet another YARN fix) (#3925)	Georgi Gerganov
2023-11-03	llama : change yarn_ext_factor placeholder to -1 (#3922)	cebtenzzre
2023-11-02	cuda : add ROCM aliases for CUDA pool stuff (#3918)	Kerfuffle
2023-11-02	cmake : fix relative path to git submodule index (#3915)	Andrei
2023-11-02	readme : add notice about #3912	Georgi Gerganov
2023-11-02	cuda : fix const ptrs warning causing ROCm build issues (#3913)	Georgi Gerganov
2023-11-02	cuda : use CUDA memory pool with async memory allocation/deallocation when av...	Oleksii Maryshchenko
2023-11-02	gguf : print error for GGUFv1 files (#3908)	Georgi Gerganov
2023-11-02	cmake : disable LLAMA_NATIVE by default (#3906)	slaren
2023-11-02	gguf : remove special-case code for GGUFv1 (#3901)	Georgi Gerganov
2023-11-02	llm : prevent from 1-D tensors being GPU split (#3697)	Georgi Gerganov
2023-11-02	build : link against build info instead of compiling against it (#3879)	cebtenzzre
2023-11-02	cuda : check if this fixes Pascal card regression (#3882)	Georgi Gerganov
2023-11-02	metal : fix build errors and kernel sig after #2268 (#3898)	Georgi Gerganov
2023-11-02	cuda : fix RoPE after #2268 (#3897)	cebtenzzre
2023-11-01	llama : fix llama_context_default_params after #2268 (#3893)	cebtenzzre
2023-11-01	ggml-cuda : compute ptrs for cublasGemmBatchedEx in a kernel (#3891)	slaren