summaryrefslogtreecommitdiff
path: root/ggml/src/iqk/iqk_quantize.cpp
diff options
context:
space:
mode:
authorKawrakow <iwankawrakow@gmail.com>2024-10-22 17:28:14 +0200
committerGitHub <noreply@github.com>2024-10-22 17:28:14 +0200
commitb61cf7d0d7e7c5d971087d2f919818fbf684809e (patch)
tree13b094803488737aed3dc9817e96c196f7bcbf9a /ggml/src/iqk/iqk_quantize.cpp
parent462c6cd7b1b03843ab782e36c75da9bfea657c14 (diff)
Add support for Granite and GraniteMoE models (#102)
* Add Granite and GranoteMoE models * Granite: avoid NaNs on CUDA by scaling Q before K*Q multiplication --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'ggml/src/iqk/iqk_quantize.cpp')
0 files changed, 0 insertions, 0 deletions