Age | Commit message (Expand) | Author |
---|---|---|
2023-09-28 | llama.cpp : split llama_context_params into model and context params (#3301) | slaren |
2023-09-28 | llama : custom attention mask + parallel decoding + no context swaps (#3228) | Georgi Gerganov |
2023-09-15 | examples : add compiler version and target to build info (#2998) | Cebtenzzre |
2023-09-08 | build : do not use _GNU_SOURCE gratuitously (#2035) | Przemysław Pawełczyk |
2023-08-27 | llama : more tokenizer fixes (#2810) | Georgi Gerganov |
2023-08-21 | gguf : new file format with flexible meta data (beta) (#2398) | Georgi Gerganov |
2023-08-04 | build : fix several cast and printf warnings (#2499) | Borislav Stanimirov |
2023-07-10 | mpi : add support for distributed inference via MPI (#2099) | Evan Miller |
2023-06-26 | ggml : add NUMA support (#1556) | zrm |
2023-06-24 | llama : make model stateless and context stateful (llama_state) (#1797) | Didzis Gosko |
2023-06-16 | examples : add "simple" (#1840) | SuperUserNameMan |