ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

diff options

author	Shintarou Okada <kokuzen@gmail.com>	2023-12-24 22:35:49 +0900
committer	GitHub <noreply@github.com>	2023-12-24 15:35:49 +0200
commit	753be377b69bda2d65a7e089f2b7f0c53ef3495e (patch)
tree	b32ae0b6fb10db974322edeeb22021bc43d1e210 /gguf-py/scripts/gguf-convert-endian.py
parent	5bf3953d7e9831ea22b0bc017ce97409b801ccf1 (diff)

llama : add PLaMo model (#3557)

* add plamo mock * add tensor loading * plamo convert * update norm * able to compile * fix norm_rms_eps hparam * runnable * use inp_pos * seems ok * update kqv code * remove develop code * update README * shuffle attn_q.weight and attn_output.weight for broadcasting * remove plamo_llm_build_kqv and use llm_build_kqv * fix style * update * llama : remove obsolete KQ_scale * plamo : fix tensor names for correct GPU offload --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Diffstat (limited to 'gguf-py/scripts/gguf-convert-endian.py')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: