diff options
| author | Steffen Röcker <sroecker@gmail.com> | 2024-05-18 10:04:55 +0200 |
|---|---|---|
| committer | GitHub <noreply@github.com> | 2024-05-18 11:04:55 +0300 |
| commit | 0f98acfac6cc561dc57586bfff778405e42b576b (patch) | |
| tree | 5ced0f623f9124ae87bc02566bf717636fbfbbac /examples/llava/llava.cpp | |
| parent | ca57e0f35e33f714b9a6c2c4482b87bfe059c819 (diff) | |
llama : add support for larger Granite Code Models (20B, 34B) (#7324)
Tie the weights for ARCH_STARCODER to support the larger Granite code models.
Partially addresses ggerganov/issues/7116
There still remains to be a few things to fix.
Currently requires `--override-kv tokenizer.ggml.add_bos_token=bool:false`
Diffstat (limited to 'examples/llava/llava.cpp')
0 files changed, 0 insertions, 0 deletions
