index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
ggml
/
src
Age
Commit message (
Expand
)
Author
2024-08-01
iq3_k: CUDA dot product
Iwan Kawrakow
2024-08-01
iq3_k: Basics
Iwan Kawrakow
2024-08-01
iq2_k: very slightly better CUDA dot product
Iwan Kawrakow
2024-08-01
iq2_k: better CUDA dot product
Iwan Kawrakow
2024-08-01
iq2_k: CUDA dot product finally works
Iwan Kawrakow
2024-08-01
iq5_k: CUDA dot product finally works
Iwan Kawrakow
2024-08-01
Factor out iqk CUDA dot products
Iwan Kawrakow
2024-08-01
iq5_k: CUDA dot product still not working
Iwan Kawrakow
2024-08-01
iq5_k: Metal
Iwan Kawrakow
2024-08-01
iq5_k: NEON
Iwan Kawrakow
2024-08-01
iq5_k: AVX512
Iwan Kawrakow
2024-08-01
iq5_k: AVX2
Iwan Kawrakow
2024-08-01
iq5_k: Basics
Iwan Kawrakow
2024-08-01
iq2_k: Metal. Dot product is wrong
Iwan Kawrakow
2024-08-01
iq2_k: NEON
Iwan Kawrakow
2024-08-01
iq2_k: slightly faster AVX512
Iwan Kawrakow
2024-08-01
iq2_k: simplify AVX512
Iwan Kawrakow
2024-08-01
iq2_k: AVX2
Iwan Kawrakow
2024-08-01
iq2_k: Basics
Iwan Kawrakow
2024-07-28
IQ4_K: SOTA 4-bit quantization (#6)
Kawrakow
2024-07-27
Simdify and multi-thread tanh (#4)
Kawrakow
2024-07-27
Merge mainline llama.cpp (#3)
Kawrakow