summaryrefslogtreecommitdiff
path: root/examples/server
diff options
context:
space:
mode:
authorpostmasters <namnguyen@google.com>2024-02-21 05:08:22 -0800
committerGitHub <noreply@github.com>2024-02-21 15:08:22 +0200
commit580111d42b3b6ad0a390bfb267d6e3077506eb31 (patch)
tree9eed0a46aacfa10e586ea478c191106f11e59feb /examples/server
parent88c46cbdac05cebd936511b1d3c74112e721615f (diff)
llama : add `gemma` model (#5631)
There are couple things in this architecture: 1. Shared input and output embedding parameters. 2. Key length and value length are not derived from `n_embd`. More information about the models can be found at https://ai.google.dev/gemma. GGUFs can be downloaded from https://huggingface.co/google.
Diffstat (limited to 'examples/server')
0 files changed, 0 insertions, 0 deletions