summaryrefslogtreecommitdiff
path: root/examples/server
diff options
context:
space:
mode:
authorslaren <slarengh@gmail.com>2023-09-30 18:12:57 +0200
committerGitHub <noreply@github.com>2023-09-30 18:12:57 +0200
commitf5ef5cfb18148131fcf45bdd2331f0db5ab7c3d0 (patch)
tree97465215d07603cfca34daf8adf8280078e0bf5e /examples/server
parent40e07a60f9ce06e79f3ccd4c903eba300fb31b5e (diff)
ggml-cuda : perform cublas mat mul of quantized types as f16 (#3412)
* ggml-cuda : perform cublas matrix multiplication of quantized types as fp16 * rename CC_TURING to CC_VOLTA * disable fp16 mat mul completely with multi GPU
Diffstat (limited to 'examples/server')
0 files changed, 0 insertions, 0 deletions