summaryrefslogtreecommitdiff
path: root/examples/imatrix/imatrix.cpp
diff options
context:
space:
mode:
authorKawrakow <iwankawrakow@gmail.com>2025-02-11 14:46:30 +0200
committerGitHub <noreply@github.com>2025-02-11 14:46:30 +0200
commit3c98bfb33d149a0d9d3bb91604dd12709721e3cf (patch)
tree6a1e5fc373032bb18a62ec3616625eedf1a9f1f3 /examples/imatrix/imatrix.cpp
parenta366a3d17d8f2de0eb8c3d9eddc7b5840fb5761a (diff)
DeepSeek FA support (CPU only) (#200)
* Adding support for K head size != V head size This is relevant for DeepSeek models. At this point ggml CPU FA works. Now I need to go and change iqk FA to make it work with Dk != Dv. * iqk support for K head size != V head size To not have compilation time explode, just Dk = 192, Dv = 128 for now (DeepSeek) * FA: very slightly faster for nq = 1 (TG) --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'examples/imatrix/imatrix.cpp')
0 files changed, 0 insertions, 0 deletions