index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
ggml-opencl.cpp
Age
Commit message (
Expand
)
Author
2023-10-20
CLBlast: Add outer loops over src0 for broadcasting in mulmat
shibe2
2023-10-18
opencl : fix element-wise multiplication (#3656)
shibe2
2023-10-17
CLBlast: Fix temporary buffer size for f16 conversion (wsize)
shibe2
2023-10-12
CLBlast: Fix matrix-vector multiplication (#3544)
shibe2
2023-10-05
CLBlast: Fix handling of on-device tensor data
shibe2
2023-10-02
CLBlast: Add broadcast support for matrix multiplication (#3402)
shibe2
2023-09-21
ggml-opencl.cpp: Make private functions static (#3300)
shibe2
2023-09-04
ggml-opencl : store GPU buffer in ggml_tensor::extra (#2994)
slaren
2023-09-03
opencl : fix a bug in ggml_cl_pool_malloc() for ggml_cl_mul_mat_f32() (#2955)
Wentai Zhang
2023-07-07
Fix opencl by wrap #if-else-endif with \n (#2086)
Howard Su
2023-07-04
[ggml] fix index for ne03 value in ggml_cl_mul_f32 (#2088)
Govlzkoy
2023-06-29
Porting the improved K-Quant CUDA kernels to OpenCL (#1966)
LostRuins
2023-06-17
ggml : fix warnings under MSVC (#1908)
Howard Su
2023-06-16
opencl : support k-quants (#1836)
0cc4m
2023-06-12
Leverage mmap for offloading tensors to GPU (#1597)
Howard Su
2023-06-09
OpenCL: Add release memory (#1741)
Robert Sung-wook Shin
2023-06-06
Multi GPU support, CUDA refactor, CUDA scratch buffer (#1703)
Johannes Gäßler
2023-06-06
Clblast fixes + enhancements to save VRAM and offload more layers (#1675)
LostRuins
2023-06-04
OpenCL: Fix duplication of layers in VRAM and RAM, add GPU mul kernel (#1653)
0cc4m
2023-05-28
opencl : no need to allocate cl_mem on heap (#1612)
Howard Su
2023-05-28
opencl : use strstr to check if fp16 supported (#1611)
Howard Su
2023-05-23
Fix handling of "invalid property" when creating OpenCL command queue (#1565)
Maarten ter Huurne
2023-05-23
OpenCL Token Generation Acceleration (#1459)
0cc4m