summaryrefslogtreecommitdiff
path: root/examples/llama.swiftui/llama.cpp.swift/LibLlama.swift
diff options
context:
space:
mode:
authorEbey Abraham <ebey97@gmail.com>2023-12-18 17:27:47 +0000
committerGitHub <noreply@github.com>2023-12-18 19:27:47 +0200
commitb9e74f9bca5fdf7d0a22ed25e7a9626335fdfa48 (patch)
treeb150a0d4490627bfc9cdd758d08d026fc70b0882 /examples/llama.swiftui/llama.cpp.swift/LibLlama.swift
parent3c04bf6da89eaf4c7d317e0518f0687dfcbf2de7 (diff)
llama : add phi-2 + fix NeoX rope + ggml_mul_mat_set_prec (#4490)
* phi2 implementation * fix breaking change * phi-2 : various fixes * phi-2 : use layer norm eps * py : whitespaces * llama : fix meta KV override bug * convert : phi don't add BOS token * convert : revert "added_tokens_decoder" change * phi-2 : scale Q instead of KQ for better precision * ggml : fix NeoX rope to rotate just first n_dims * cuda : less diff in the rope_neox kernel * ggml : add ggml_mul_mat_set_prec ggml-ci * Update ggml-cuda.cu Co-authored-by: slaren <slarengh@gmail.com> * Update ggml-cuda.cu Co-authored-by: slaren <slarengh@gmail.com> * cuda : ggml_cuda_op_mul_mat_cublas support F32 precision * cuda : remove oboslete comment --------- Co-authored-by: Ebey Abraham <ebeyabraham@microsoft.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> Co-authored-by: slaren <slarengh@gmail.com>
Diffstat (limited to 'examples/llama.swiftui/llama.cpp.swift/LibLlama.swift')
0 files changed, 0 insertions, 0 deletions