summaryrefslogtreecommitdiff
path: root/scripts
AgeCommit message (Expand)Author
2024-08-12Merge mainline - Aug 12 2024 (#17)Kawrakow
2024-07-27Merge mainline llama.cpp (#3)Kawrakow
2024-06-18tokenizer : BPE fixes (#7530)jaime-m-p
2024-06-18ggml : syncGeorgi Gerganov
2024-06-13`build`: rename main → llama-cli, server → llama-server, llava-cli → ll...Olivier Chafik
2024-06-04common : refactor cli arg parsing (#7675)Georgi Gerganov
2024-06-04ggml : remove OpenCL (#7735)Georgi Gerganov
2024-06-04llama-bench : allow using a different printer for stderr with -oe (#7722)slaren
2024-05-31scripts: update compare_llama_bench.py [no ci] (#7673)Johannes Gäßler
2024-05-30Move convert.py to examples/convert-legacy-llama.py (#7430)Galunid
2024-05-29scripts : remove mpi remnantsGeorgi Gerganov
2024-05-29sync : ggmlGeorgi Gerganov
2024-05-20llama : remove MPI backend (#7395)slaren
2024-05-18Unicode codepoint flags for custom regexs (#7245)jaime-m-p
2024-05-17Added a single test function script and fix debug-test.sh to be more robust (...Brian
2024-05-15sync : ggmlGeorgi Gerganov
2024-05-14script : sync ggml-rpcGeorgi Gerganov
2024-05-14sync : ggmlGeorgi Gerganov
2024-05-11metal : fix warnings (skipme) (#0)Georgi Gerganov
2024-05-11sync : ggmlGeorgi Gerganov
2024-05-12Scripting & documenting debugging one test without anything else in the loop....Josh Ramer
2024-05-11sync : ggmlGeorgi Gerganov
2024-05-10llama-bench : add pp+tg test type (#7199)slaren
2024-05-09llama3 custom regex split (#6965)jaime-m-p
2024-05-08compare-llama-bench.py: add missing basicConfig (#7138)Brian
2024-05-05py : logging and flake8 suppression refactoring (#7081)Brian
2024-05-04tests : add test-tokenizer-0.sh + fix some tokenizers (#7036)Georgi Gerganov
2024-05-03convert.py : add python logging instead of print() (#6511)Brian
2024-04-29llama : fix BPE pre-tokenization (#6920)Georgi Gerganov
2024-04-21`build`: generate hex dump of server assets during build (#6661)Olivier Chafik
2024-04-18ggml : group all experts in a single ggml_mul_mat_id (#6505)slaren
2024-04-13model: support arch `DbrxForCausalLM` (#6515)Pierrick Hymbert
2024-04-11scripts : add --outdir option to hf.sh (#6600)Daniel Bevenius
2024-04-09sync : ggmlGeorgi Gerganov
2024-04-09license : update copyright notice + add AUTHORS (#6405)Georgi Gerganov
2024-04-07sync : ggmlGeorgi Gerganov
2024-04-07scripts : sync ggml-cuda folderGeorgi Gerganov
2024-04-06sync : ggmlGeorgi Gerganov
2024-04-01compare-llama-bench.py: fix long hexsha args (#6424)Johannes Gäßler
2024-03-29sync : ggml (#6351)Georgi Gerganov
2024-03-26cuda : rename build flag to LLAMA_CUDA (#6299)slaren
2024-03-23lookup: complement data from context with general text statistics (#5479)Johannes Gäßler
2024-03-10sync : ggmlGeorgi Gerganov
2024-03-09ggml : add ggml-common.h to deduplicate shared code (#5940)Georgi Gerganov
2024-03-05compare-llama-bench.py : remove mul_mat_q (#5892)slaren
2024-03-04sync : ggmlGeorgi Gerganov
2024-03-04sync : ggmlGeorgi Gerganov
2024-03-02scripts : add pod-llama.shGeorgi Gerganov
2024-03-01llama : cleanup unused mmq flags (#5772)Pierrick Hymbert
2024-02-28sync : ggmlGeorgi Gerganov