Age | Commit message (Expand) | Author |
---|---|---|
2024-01-19 | llama : support upcoming Qwen2 (#5037) | Shijie |
2024-01-19 | llama : add CodeShell support (#5016) | chiranko |
2024-01-13 | convert : update phi-2 to latest HF repo (#4903) | Georgi Gerganov |
2024-01-02 | llama : differentiate the KV dims in the attention (#4657) | postmasters |
2023-12-28 | gpt2 : Add gpt2 architecture integration (#4555) | manikbhandari |
2023-12-27 | llama : add AWQ for llama, llama2, mpt, and mistral models (#4593) | Nam D. Tran |
2023-12-24 | llama : add PLaMo model (#3557) | Shintarou Okada |
2023-12-18 | llama : add phi-2 + fix NeoX rope + ggml_mul_mat_set_prec (#4490) | Ebey Abraham |
2023-12-13 | llama : add Mixtral support (#4406) | slaren |
2023-12-01 | llama : add Qwen support (#4281) | Shijie |
2023-11-19 | gguf-py : export chat templates (#4125) | slaren |
2023-11-14 | stablelm : StableLM support (#3586) | Galunid |
2023-11-11 | gguf-py: Refactor and allow reading/modifying existing GGUF files (#3981) | Kerfuffle |