ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2024-05-20	[SYCL] Update SYCL upscale operation (#7321)	AidanBeltonS
2024-05-15	Add missing " (#7303)	AidanBeltonS
2024-05-15	ggml : add `ggml_upscale_ext` (ggml/814)	John Balis
2024-05-13	[SYCL] rm wait() (#7233)	Neo Zhang
2024-05-11	ggml : full ALiBi support (#7192)	Georgi Gerganov
2024-05-10	Minor arithmetic improvement to mmvq wrapper kernel (#7172)	Ouadie EL FAROUKI
2024-04-30	ggml : add Flash Attention (#5021)	Georgi Gerganov
2024-04-28	add device version in device list (#6959)	Neo Zhang
2024-04-18	ggml : group all experts in a single ggml_mul_mat_id (#6505)	slaren
2024-04-15	fix mul_mat_id() for new input, make the ut pass (#6682)	Neo Zhang Jianyu
2024-04-14	fix memcpy() crash, add missed cmd in guide, fix softmax (#6622)	Neo Zhang Jianyu
2024-04-08	remove row=1 cond (#6532)	Abhilash Majumder
2024-04-07	support/fix OPs GGML_TYPE_IQ4_NL, GGML_TYPE_IQ4_XS, GGML_TYPE_IQ3_XXS, GGML_T...	Neo Zhang Jianyu
2024-04-05	[SYCL] Fixed minor bug when enabling FP16 for non intel targets (#6464)	Ouadie EL FAROUKI
2024-04-03	[SYCL] Disable iqx on windows as WA (#6435)	Meng, Hengyu
2024-03-28	[SYCL] fix set main gpu crash (#6339)	Neo Zhang Jianyu
2024-03-27	[SYCL] Fix batched impl for NVidia GPU (#6164)	AidanBeltonS
2024-03-26	llama : greatly reduce output buffer memory usage (#6122)	compilade
2024-03-24	[SYCL] offload op (#6217)	Meng, Hengyu
2024-03-21	Add nvidia and amd backends (#6157)	AidanBeltonS
2024-03-18	backend : offload large batches to GPU (#6083)	slaren
2024-03-15	fix set main gpu error (#6073)	Neo Zhang Jianyu
2024-03-15	[SYCL] Fix non-intel device selection (#6042)	AidanBeltonS
2024-03-13	llama : add pipeline parallelism support (#6017)	slaren
2024-03-13	Update get version (#6025)	AidanBeltonS
2024-03-12	ggml : reuse quantum structs across backends (#5943)	Georgi Gerganov
2024-03-12	sycl : update IQ1_S kernels (WIP - not working!) (#5995)	Georgi Gerganov
2024-03-11	[SYCL] Add q3_s and q1_s (#5886)	Abhilash Majumder
2024-03-09	ggml : add ggml-common.h to deduplicate shared code (#5940)	Georgi Gerganov
2024-03-07	Revert "[SYCL] fix error when set main gpu to non-zero (#5901)" (#5918)	Neo Zhang Jianyu
2024-03-07	[SYCL] fix error when set main gpu to non-zero (#5901)	Neo Zhang Jianyu
2024-03-06	add wait() to make code stable (#5895)	Neo Zhang Jianyu
2024-03-05	[SYCL] fix mul_mat fault in CI/unit-test (#5862)	Neo Zhang Jianyu
2024-03-04	ggml : introduce ggml_status (ggml/750)	Michael Podvitskiy
2024-03-02	Support multiple GPUs (split mode) on SYCL backend (#5806)	Neo Zhang Jianyu
2024-03-01	[SYCL] Use batched mul_mat pathway (#5591)	AidanBeltonS
2024-02-28	Introduce backend GUIDs (ggml/743)	UEXTM.com
2024-02-26	[SYCL] Add support for soft_max ALiBi (#5639)	AidanBeltonS
2024-02-25	code : normalize enum names (#5697)	Georgi Gerganov
2024-02-21	[SYCL] conext add name (#5624)	Meng, Hengyu
2024-02-20	Update ggml_sycl_op_mul_mat_vec_q (#5502)	AidanBeltonS
2024-02-12	ggml-sycl: Replace 3d ops with macro (#5458)	Abhilash Majumder
2024-02-08	Fix f16_sycl cpy call from Arc (#5411)	Abhilash Majumder
2024-02-05	[SYCL] Fix cpy with dims of 3 (#5289)	AidanBeltonS
2024-02-03	Fix im2col with 32fp (#5286)	AidanBeltonS
2024-02-02	Tidy ggml-sycl (#5261)	AidanBeltonS
2024-02-02	[SYCL] get MAX_MEM_ALLOC from device property (#5270)	Meng, Hengyu
2024-02-01	add --no-mmap in llama-bench (#5257)	Neo Zhang Jianyu
2024-01-31	format license text, restore apache license by legal suggestion (#5233)	Neo Zhang Jianyu
2024-01-28	ggml : add Vulkan backend (#2059)	0cc4m