summaryrefslogtreecommitdiff
path: root/README.md
diff options
context:
space:
mode:
authornopperl <54780682+nopperl@users.noreply.github.com>2024-04-19 09:35:54 +0000
committerGitHub <noreply@github.com>2024-04-19 11:35:54 +0200
commit9958c81b798a5872087b30b360e4674871f2479e (patch)
tree4f5bb8ee68ce6da0bdb070dbfb329de333377190 /README.md
parent8b1b1f4982d3e9b994308d05a1c8b9e45c23edb5 (diff)
Implement the OLMo architecture (#6741)
* implement olmo architecture * remove unused variable * remove unused moe branch * remove check for weight * remove superfluous moe, bias and rope tensors * clarified comment * fix clamp_kqv setting * remove obsolete parameter name filter
Diffstat (limited to 'README.md')
-rw-r--r--README.md1
1 files changed, 1 insertions, 0 deletions
diff --git a/README.md b/README.md
index e3eae60b..08661831 100644
--- a/README.md
+++ b/README.md
@@ -122,6 +122,7 @@ Typically finetunes of the base models below are supported as well.
- [x] [Command-R](https://huggingface.co/CohereForAI/c4ai-command-r-v01)
- [x] [SEA-LION](https://huggingface.co/models?search=sea-lion)
- [x] [GritLM-7B](https://huggingface.co/GritLM/GritLM-7B) + [GritLM-8x7B](https://huggingface.co/GritLM/GritLM-8x7B)
+- [x] [OLMo](https://allenai.org/olmo)
(instructions for supporting more models: [HOWTO-add-model.md](./docs/HOWTO-add-model.md))