diff options
| author | fairydreaming <166155368+fairydreaming@users.noreply.github.com> | 2024-05-23 11:49:53 +0200 |
|---|---|---|
| committer | GitHub <noreply@github.com> | 2024-05-23 11:49:53 +0200 |
| commit | 9b82476ee9e73065a759f8bcc4cf27ec7ab2ed8c (patch) | |
| tree | d4881d12bc7e60750f90e642e3fabbdf4029fc53 /gguf-py | |
| parent | a61a94e543e3c6877c087e80fca27a0313ce5fd5 (diff) | |
Add missing inference support for GPTNeoXForCausalLM (Pythia and GPT-NeoX base models) (#7461)
* convert-hf : add conversion of bloom-style qkv tensor to gpt-style qkv (code borrowed from BloomModel)
* llama : add inference support for LLM_ARCH_GPTNEOX
* llama : add model types for every Pythia variant and GPT-NeoX
Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>
Diffstat (limited to 'gguf-py')
0 files changed, 0 insertions, 0 deletions
