summaryrefslogtreecommitdiff
path: root/ggml-sycl.cpp
AgeCommit message (Expand)Author
2024-05-20[SYCL] Update SYCL upscale operation (#7321)AidanBeltonS
2024-05-15Add missing " (#7303)AidanBeltonS
2024-05-15ggml : add `ggml_upscale_ext` (ggml/814)John Balis
2024-05-13[SYCL] rm wait() (#7233)Neo Zhang
2024-05-11ggml : full ALiBi support (#7192)Georgi Gerganov
2024-05-10Minor arithmetic improvement to mmvq wrapper kernel (#7172)Ouadie EL FAROUKI
2024-04-30ggml : add Flash Attention (#5021)Georgi Gerganov
2024-04-28add device version in device list (#6959)Neo Zhang
2024-04-18ggml : group all experts in a single ggml_mul_mat_id (#6505)slaren
2024-04-15fix mul_mat_id() for new input, make the ut pass (#6682)Neo Zhang Jianyu
2024-04-14fix memcpy() crash, add missed cmd in guide, fix softmax (#6622)Neo Zhang Jianyu
2024-04-08remove row=1 cond (#6532)Abhilash Majumder
2024-04-07support/fix OPs GGML_TYPE_IQ4_NL, GGML_TYPE_IQ4_XS, GGML_TYPE_IQ3_XXS, GGML_T...Neo Zhang Jianyu
2024-04-05[SYCL] Fixed minor bug when enabling FP16 for non intel targets (#6464)Ouadie EL FAROUKI
2024-04-03[SYCL] Disable iqx on windows as WA (#6435)Meng, Hengyu
2024-03-28[SYCL] fix set main gpu crash (#6339)Neo Zhang Jianyu
2024-03-27[SYCL] Fix batched impl for NVidia GPU (#6164)AidanBeltonS
2024-03-26llama : greatly reduce output buffer memory usage (#6122)compilade
2024-03-24[SYCL] offload op (#6217)Meng, Hengyu
2024-03-21Add nvidia and amd backends (#6157)AidanBeltonS
2024-03-18backend : offload large batches to GPU (#6083)slaren
2024-03-15fix set main gpu error (#6073)Neo Zhang Jianyu
2024-03-15[SYCL] Fix non-intel device selection (#6042)AidanBeltonS
2024-03-13llama : add pipeline parallelism support (#6017)slaren
2024-03-13Update get version (#6025)AidanBeltonS
2024-03-12ggml : reuse quantum structs across backends (#5943)Georgi Gerganov
2024-03-12sycl : update IQ1_S kernels (WIP - not working!) (#5995)Georgi Gerganov
2024-03-11[SYCL] Add q3_s and q1_s (#5886)Abhilash Majumder
2024-03-09ggml : add ggml-common.h to deduplicate shared code (#5940)Georgi Gerganov
2024-03-07Revert "[SYCL] fix error when set main gpu to non-zero (#5901)" (#5918)Neo Zhang Jianyu
2024-03-07[SYCL] fix error when set main gpu to non-zero (#5901)Neo Zhang Jianyu
2024-03-06add wait() to make code stable (#5895)Neo Zhang Jianyu
2024-03-05[SYCL] fix mul_mat fault in CI/unit-test (#5862)Neo Zhang Jianyu
2024-03-04ggml : introduce ggml_status (ggml/750)Michael Podvitskiy
2024-03-02Support multiple GPUs (split mode) on SYCL backend (#5806)Neo Zhang Jianyu
2024-03-01[SYCL] Use batched mul_mat pathway (#5591)AidanBeltonS
2024-02-28Introduce backend GUIDs (ggml/743)UEXTM.com
2024-02-26[SYCL] Add support for soft_max ALiBi (#5639)AidanBeltonS
2024-02-25code : normalize enum names (#5697)Georgi Gerganov
2024-02-21[SYCL] conext add name (#5624)Meng, Hengyu
2024-02-20Update ggml_sycl_op_mul_mat_vec_q (#5502)AidanBeltonS
2024-02-12ggml-sycl: Replace 3d ops with macro (#5458)Abhilash Majumder
2024-02-08Fix f16_sycl cpy call from Arc (#5411)Abhilash Majumder
2024-02-05[SYCL] Fix cpy with dims of 3 (#5289)AidanBeltonS
2024-02-03Fix im2col with 32fp (#5286)AidanBeltonS
2024-02-02Tidy ggml-sycl (#5261)AidanBeltonS
2024-02-02[SYCL] get MAX_MEM_ALLOC from device property (#5270)Meng, Hengyu
2024-02-01add --no-mmap in llama-bench (#5257)Neo Zhang Jianyu
2024-01-31format license text, restore apache license by legal suggestion (#5233)Neo Zhang Jianyu
2024-01-28ggml : add Vulkan backend (#2059)0cc4m