diff options
author | Georgi Gerganov <ggerganov@gmail.com> | 2023-09-07 15:49:09 +0300 |
---|---|---|
committer | GitHub <noreply@github.com> | 2023-09-07 15:49:09 +0300 |
commit | c4f496648c1e32efeb714200e7eae7fc7cfbb223 (patch) | |
tree | 876320eb5fa8b02682e0b0d88fe325b40da2f23a /llama.cpp | |
parent | fec2fb19e4229aac58c98171c46e77144b99f8a3 (diff) |
metal : fix kernel_norm (fixes Falcon on Metal) (#3057)
* metal : fix kernel_norm
ggml-ci
* metal : put warning in kernel_norm to not combine the loops
* metal : restore original F16 mat-vec multiplication
It works after the norm fixes
* common : don't do warm-up with more than n_batch tokens (close #3058)
ggml-ci
* metal : minor
Diffstat (limited to 'llama.cpp')
0 files changed, 0 insertions, 0 deletions