diff options
author | Shijie <821898965@qq.com> | 2024-04-16 23:40:48 +0800 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-04-16 18:40:48 +0300 |
commit | f4dea7da1841a92d2788b0535063abf2f0e28461 (patch) | |
tree | c7a729d974e4315c71c78eea84fa08dda920b649 /common/json.hpp | |
parent | 8a56075b07a8b571bf95a912ffdce4c928c2b414 (diff) |
llama : add qwen2moe (#6074)
* support qwen2moe
* fix-review
* metal : support unary ops for nelements % 4 != 0
* metal : require contiguousness for float4 unary kernels
* metal : require contiguousness for float4 unary kernels (cont)
* fix-review
* names : for brevity "SHARED_EXP" -> "SHEXP"
* llama : reuse build_moe_ffn()
* llama : add model type name
---------
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Diffstat (limited to 'common/json.hpp')
0 files changed, 0 insertions, 0 deletions