index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
ggml-sycl.cpp
Age
Commit message (
Expand
)
Author
2024-03-28
[SYCL] fix set main gpu crash (#6339)
Neo Zhang Jianyu
2024-03-27
[SYCL] Fix batched impl for NVidia GPU (#6164)
AidanBeltonS
2024-03-26
llama : greatly reduce output buffer memory usage (#6122)
compilade
2024-03-24
[SYCL] offload op (#6217)
Meng, Hengyu
2024-03-21
Add nvidia and amd backends (#6157)
AidanBeltonS
2024-03-18
backend : offload large batches to GPU (#6083)
slaren
2024-03-15
fix set main gpu error (#6073)
Neo Zhang Jianyu
2024-03-15
[SYCL] Fix non-intel device selection (#6042)
AidanBeltonS
2024-03-13
llama : add pipeline parallelism support (#6017)
slaren
2024-03-13
Update get version (#6025)
AidanBeltonS
2024-03-12
ggml : reuse quantum structs across backends (#5943)
Georgi Gerganov
2024-03-12
sycl : update IQ1_S kernels (WIP - not working!) (#5995)
Georgi Gerganov
2024-03-11
[SYCL] Add q3_s and q1_s (#5886)
Abhilash Majumder
2024-03-09
ggml : add ggml-common.h to deduplicate shared code (#5940)
Georgi Gerganov
2024-03-07
Revert "[SYCL] fix error when set main gpu to non-zero (#5901)" (#5918)
Neo Zhang Jianyu
2024-03-07
[SYCL] fix error when set main gpu to non-zero (#5901)
Neo Zhang Jianyu
2024-03-06
add wait() to make code stable (#5895)
Neo Zhang Jianyu
2024-03-05
[SYCL] fix mul_mat fault in CI/unit-test (#5862)
Neo Zhang Jianyu
2024-03-04
ggml : introduce ggml_status (ggml/750)
Michael Podvitskiy
2024-03-02
Support multiple GPUs (split mode) on SYCL backend (#5806)
Neo Zhang Jianyu
2024-03-01
[SYCL] Use batched mul_mat pathway (#5591)
AidanBeltonS
2024-02-28
Introduce backend GUIDs (ggml/743)
UEXTM.com
2024-02-26
[SYCL] Add support for soft_max ALiBi (#5639)
AidanBeltonS
2024-02-25
code : normalize enum names (#5697)
Georgi Gerganov
2024-02-21
[SYCL] conext add name (#5624)
Meng, Hengyu
2024-02-20
Update ggml_sycl_op_mul_mat_vec_q (#5502)
AidanBeltonS
2024-02-12
ggml-sycl: Replace 3d ops with macro (#5458)
Abhilash Majumder
2024-02-08
Fix f16_sycl cpy call from Arc (#5411)
Abhilash Majumder
2024-02-05
[SYCL] Fix cpy with dims of 3 (#5289)
AidanBeltonS
2024-02-03
Fix im2col with 32fp (#5286)
AidanBeltonS
2024-02-02
Tidy ggml-sycl (#5261)
AidanBeltonS
2024-02-02
[SYCL] get MAX_MEM_ALLOC from device property (#5270)
Meng, Hengyu
2024-02-01
add --no-mmap in llama-bench (#5257)
Neo Zhang Jianyu
2024-01-31
format license text, restore apache license by legal suggestion (#5233)
Neo Zhang Jianyu
2024-01-28
ggml : add Vulkan backend (#2059)
0cc4m
2024-01-28
ggml : add unified SYCL backend for Intel GPUs (#2690)
Abhilash Majumder