diff options
author | Steffen Röcker <sroecker@gmail.com> | 2024-05-18 10:04:55 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-05-18 11:04:55 +0300 |
commit | 0f98acfac6cc561dc57586bfff778405e42b576b (patch) | |
tree | 5ced0f623f9124ae87bc02566bf717636fbfbbac /examples/retrieval | |
parent | ca57e0f35e33f714b9a6c2c4482b87bfe059c819 (diff) |
llama : add support for larger Granite Code Models (20B, 34B) (#7324)
Tie the weights for ARCH_STARCODER to support the larger Granite code models.
Partially addresses ggerganov/issues/7116
There still remains to be a few things to fix.
Currently requires `--override-kv tokenizer.ggml.add_bos_token=bool:false`
Diffstat (limited to 'examples/retrieval')
0 files changed, 0 insertions, 0 deletions