summaryrefslogtreecommitdiff
path: root/ggml-opencl.cpp
AgeCommit message (Expand)Author
2023-10-20CLBlast: Add outer loops over src0 for broadcasting in mulmatshibe2
2023-10-18opencl : fix element-wise multiplication (#3656)shibe2
2023-10-17CLBlast: Fix temporary buffer size for f16 conversion (wsize)shibe2
2023-10-12CLBlast: Fix matrix-vector multiplication (#3544)shibe2
2023-10-05CLBlast: Fix handling of on-device tensor datashibe2
2023-10-02CLBlast: Add broadcast support for matrix multiplication (#3402)shibe2
2023-09-21ggml-opencl.cpp: Make private functions static (#3300)shibe2
2023-09-04ggml-opencl : store GPU buffer in ggml_tensor::extra (#2994)slaren
2023-09-03opencl : fix a bug in ggml_cl_pool_malloc() for ggml_cl_mul_mat_f32() (#2955)Wentai Zhang
2023-07-07Fix opencl by wrap #if-else-endif with \n (#2086)Howard Su
2023-07-04[ggml] fix index for ne03 value in ggml_cl_mul_f32 (#2088)Govlzkoy
2023-06-29Porting the improved K-Quant CUDA kernels to OpenCL (#1966)LostRuins
2023-06-17ggml : fix warnings under MSVC (#1908)Howard Su
2023-06-16opencl : support k-quants (#1836)0cc4m
2023-06-12Leverage mmap for offloading tensors to GPU (#1597)Howard Su
2023-06-09OpenCL: Add release memory (#1741)Robert Sung-wook Shin
2023-06-06Multi GPU support, CUDA refactor, CUDA scratch buffer (#1703)Johannes Gäßler
2023-06-06Clblast fixes + enhancements to save VRAM and offload more layers (#1675)LostRuins
2023-06-04OpenCL: Fix duplication of layers in VRAM and RAM, add GPU mul kernel (#1653)0cc4m
2023-05-28opencl : no need to allocate cl_mem on heap (#1612)Howard Su
2023-05-28opencl : use strstr to check if fp16 supported (#1611)Howard Su
2023-05-23Fix handling of "invalid property" when creating OpenCL command queue (#1565)Maarten ter Huurne
2023-05-23OpenCL Token Generation Acceleration (#1459)0cc4m