summaryrefslogtreecommitdiff
path: root/gguf-py/scripts/gguf-convert-endian.py
diff options
context:
space:
mode:
authorShintarou Okada <kokuzen@gmail.com>2023-12-24 22:35:49 +0900
committerGitHub <noreply@github.com>2023-12-24 15:35:49 +0200
commit753be377b69bda2d65a7e089f2b7f0c53ef3495e (patch)
treeb32ae0b6fb10db974322edeeb22021bc43d1e210 /gguf-py/scripts/gguf-convert-endian.py
parent5bf3953d7e9831ea22b0bc017ce97409b801ccf1 (diff)
llama : add PLaMo model (#3557)
* add plamo mock * add tensor loading * plamo convert * update norm * able to compile * fix norm_rms_eps hparam * runnable * use inp_pos * seems ok * update kqv code * remove develop code * update README * shuffle attn_q.weight and attn_output.weight for broadcasting * remove plamo_llm_build_kqv and use llm_build_kqv * fix style * update * llama : remove obsolete KQ_scale * plamo : fix tensor names for correct GPU offload --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Diffstat (limited to 'gguf-py/scripts/gguf-convert-endian.py')
0 files changed, 0 insertions, 0 deletions