summaryrefslogtreecommitdiff
path: root/common/json-schema-to-grammar.cpp
diff options
context:
space:
mode:
authorKawrakow <48489457+ikawrakow@users.noreply.github.com>2024-03-21 08:27:57 +0100
committerGitHub <noreply@github.com>2024-03-21 08:27:57 +0100
commit76aa30a26353f597e4fbe3cf776772ae812af89a (patch)
tree35654d27aa0f3fd656aa5cab1125999c13ae5201 /common/json-schema-to-grammar.cpp
parentc5b8595e3f4f4ed319ef71c9c9d868d1b7a27626 (diff)
Add ability to use Q5_0, Q5_1, and IQ4_NL for quantized K cache (#6183)
* k_cache: be able to use Q5_0 * k_cache: be able to use Q5_1 on CODA * k_cache: be able to use Q5_0 on Metal * k_cache: be able to use Q5_1 on Metal * k_cache: be able to use IQ4_NL - just CUDA for now * k_cache: be able to use IQ4_NL on Metal * k_cache: add newly added supported types to llama-bench and CUDA supports_op --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Diffstat (limited to 'common/json-schema-to-grammar.cpp')
0 files changed, 0 insertions, 0 deletions