ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2024-03-02	scripts : add pod-llama.sh	Georgi Gerganov

2024-03-01	llama : cleanup unused mmq flags (#5772)	Pierrick Hymbert
	* cleanup unused --no-mul-mat-q,-nommq, -mmq, --mul-mat-q, mul_mat_q * remove: mul_mat_q in compare llama bench and usage * update llama-bench --------- Co-authored-by: slaren <slarengh@gmail.com>
2024-02-28	sync : ggml	Georgi Gerganov

2024-02-22	sync : ggml	Georgi Gerganov

2024-02-21	sync : ggml	Georgi Gerganov

2024-02-21	sync : ggml (#5633)	Georgi Gerganov
	* ggml : fix conv_2d batch mode (ggml/737) Co-authored-by: bssrdf <bssrdf@gmail.com> * ggml : compute forward no longer pass src tensors (ggml/729) * sync : ggml ggml-ci --------- Co-authored-by: bssrdf <merlintiger@hotmail.com> Co-authored-by: bssrdf <bssrdf@gmail.com>
2024-02-19	sync : ggml	Georgi Gerganov
	ggml-ci
2024-02-18	build : pass all warning flags to nvcc via -Xcompiler (#5570)	Jared Van Bortel
	* build : pass all warning flags to nvcc via -Xcompiler * make : fix apparent mis-merge from #3952 * make : fix incorrect GF_CC_VER for CUDA host compiler
2024-02-18	ci : fix wikitext url + compile warnings (#5569)	Georgi Gerganov
	ggml-ci
2024-02-16	scripts : add helpers script for bench comparing commits (#5521)	Georgi Gerganov
	* scripts : add helpers script for bench comparing commits * scripts : detect CUDA * set flags after checking the command line * fix make flags --------- Co-authored-by: slaren <slarengh@gmail.com>
2024-02-15	scripts : add hf.sh helper script (#5501)	Georgi Gerganov
	* scripts : add hf.sh helper scripts * hf : add error logs * hf : add support for --repo and --file
2024-02-12	sync : ggml (#5452)	Georgi Gerganov
	* ggml-alloc : v3 (ggml/727) * ggml-alloc v3 ggml-ci * fix ci ggml-ci * whisper : check for backend buffer allocation failures * whisper : avoid leaks when initialization fails * cleanup ggml-ci * style fixes ggml-ci * sync : ggml * update llama.cpp, clip.cpp, export-lora.cpp * update finetune.cpp, train-text-from-scratch.cpp ggml-ci * ggml-backend : reduce alignment to 32 to match gguf and fix mmap --------- Co-authored-by: slaren <slarengh@gmail.com>
2024-02-10	scripts : update sync scripts with new backends	Georgi Gerganov

2024-02-10	sync : ggml	Georgi Gerganov

2024-02-05	scripts : fix typos, cleanup (#5303)	Georgi Gerganov

2024-02-05	scripts : add non-interactive server-llm.sh (#5303)	Нияз Гарифзянов
	* Update server-llm.sh Add flag --non-interactive that allows run script without asking a permission * Update scripts/server-llm.sh --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2024-02-02	scripts : parse wtype in server-llm.sh (#5167)	Georgi Gerganov
	* scripts : parse wtype in server-llm.sh * scripts : fix check for wfile
2024-01-31	support SYCL backend windows build (#5208)	Neo Zhang Jianyu
	* support SYCL backend windows build * add windows build in CI * add for win build CI * correct install oneMKL * fix install issue * fix ci * fix install cmd * fix install cmd * fix install cmd * fix install cmd * fix install cmd * fix win build * fix win build * fix win build * restore other CI part * restore as base * rm no new line * fix no new line issue, add -j * fix grammer issue * allow to trigger manually, fix format issue * fix format * add newline * fix format * fix format * fix format issuse --------- Co-authored-by: Abhilash Majumder <30946547+abhilash1910@users.noreply.github.com>
2024-01-30	sync : ggml (#0)	Georgi Gerganov

2024-01-28	sync : ggml	Georgi Gerganov

2024-01-27	sync : ggml	Georgi Gerganov

2024-01-26	scripts : move run-with-preset.py from root to scripts folder	Georgi Gerganov

2024-01-26	ci : add model tests + script wrapper (#4586)	crasm
	* scripts : add lib.sh and lib_test.sh * scripts : stub out new ci-run.sh script * scripts : switch to PascalCase for functions This looks a little odd at first, but I find it very useful as a convention to know if a command is part of our code vs a builtin. * scripts : add some fancy conversion from snake_case to PascalCase * Add venv to ci/run.sh * Revert scripts work * scripts : add wrapper script for local use of ci/run.sh * Simplify .gitignore for tests, clang-tidy fixes * Label all ctest tests * ci : ctest uses -L main * Attempt at writing ctest_with_model * Update test-model-load-cancel * ci : add ctest_with_model for debug and release ggml-ci * Fix gg_get_model function ggml-ci * got stuck on CMake * Add get_model.cpp to tests/CMakeLists.txt ggml-ci * Fix README.md output for ctest_with_model ggml-ci * workflows : use `-L main` for all ctest ggml-ci * Fixes * GG_RUN_CTEST_MODELFILE => LLAMACPP_TESTMODELFILE * Always show warning rather than failing if model file variable is not set * scripts : update usage text for ci-run.sh
2024-01-18	scripts : add get-winogrande.sh	Georgi Gerganov

2024-01-18	scritps : add helper script to get hellaswag data in txt format	Georgi Gerganov

2024-01-17	sync : ggml	Georgi Gerganov

2024-01-14	scripts : sync-ggml-am.sh option to skip commits	Georgi Gerganov

2024-01-14	sync : ggml	Georgi Gerganov

2024-01-13	compare-llama-bench: tweak output format (#4910)	Johannes Gäßler

2024-01-12	sync : ggml	Georgi Gerganov

2024-01-11	sync : ggml	Georgi Gerganov

2024-01-10	Python script to compare commits with llama-bench (#4844)	Johannes Gäßler

2024-01-09	scripts : improve get-pg.sh (#4838)	Georgi Gerganov

2024-01-09	scripts : script to get Paul Graham essays in txt format (#4838)	Georgi Gerganov

2024-01-05	metal : switch back to default.metallib (ggml/681)	Georgi Gerganov
	ggml-ci
2024-01-03	cuda : simplify expression	Georgi Gerganov
	Co-authored-by: slaren <slarengh@gmail.com>
2024-01-03	sync : ggml	Georgi Gerganov
	ggml-ci
2024-01-03	scripts : fix sync order + metal sed	Georgi Gerganov

2023-12-29	python : add check-requirements.sh and GitHub workflow (#4585)	crasm
	* python: add check-requirements.sh and GitHub workflow This script and workflow forces package versions to remain compatible across all convert.py scripts, while allowing secondary convert scripts to import dependencies not wanted in convert.py. Move requirements into ./requirements * Fail on "==" being used for package requirements (but can be suppressed) * Enforce "compatible release" syntax instead of == * Update workflow * Add upper version bound for transformers and protobuf * improve check-requirements.sh * small syntax change * don't remove venvs if nocleanup is passed * See if this fixes docker workflow * Move check-requirements.sh into ./scripts/ --------- Co-authored-by: Jared Van Bortel <jared@nomic.ai>
2023-12-29	scripts : print list of sync commits	Georgi Gerganov

2023-12-29	sync : ggml	Georgi Gerganov

2023-12-29	scripts : do not sync commits from this repo	Georgi Gerganov

2023-12-27	scripts : add sync-ggml-am.sh	Georgi Gerganov

2023-12-13	build : detect host compiler and cuda compiler separately (#4414)	Jared Van Bortel

2023-12-07	sync : ggml (new ops, tests, backend, etc.) (#4359)	Georgi Gerganov
	* sync : ggml (part 1) * sync : ggml (part 2, CUDA) * sync : ggml (part 3, Metal) * ggml : build fixes ggml-ci * cuda : restore lost changes * cuda : restore lost changes (StableLM rope) * cmake : enable separable compilation for CUDA ggml-ci * ggml-cuda : remove device side dequantize * Revert "cmake : enable separable compilation for CUDA" This reverts commit 09e35d04b1c4ca67f9685690160b35bc885a89ac. * cuda : remove assert for rope * tests : add test-backend-ops * ggml : fix bug in ggml_concat * ggml : restore `ggml_get_n_tasks()` logic in `ggml_graph_plan()` * ci : try to fix macOS * ggml-backend : remove backend self-registration * ci : disable Metal for macOS cmake build ggml-ci * metal : fix "supports family" call * metal : fix assert * metal : print resource path ggml-ci --------- Co-authored-by: slaren <slarengh@gmail.com>
2023-11-27	cmake : fix issue with version info not getting baked into LlamaConfig.cmake ↵	bandoti
	(#3970) * Split CPP generation from build-info query * Remove blank lines * Add BUILD_SHARED_LIBS option
2023-11-13	sync : ggml (backend v2) (#3912)	Georgi Gerganov
	* sync : ggml (backend v2) (wip) * sync : migrate examples and llama.cpp to dynamic graphs (wip) * sync : update tests + fix max op params to 64 ggml-ci * sync : ggml-cuda ggml-ci * llama : fix save/load state context size ggml-ci * sync : try to fix build on tvOS * sync : pass custom graph sizes in training examples * sync : update graph copies to new ggml API * sync : update sync-ggml.sh with new files * scripts : fix header in sync script * train : fix context size calculations * llama : increase inference graph size up to 4096 nodes * train : allocate grads for backward graphs * train : allocate grads for gb_tmp
2023-11-02	build : link against build info instead of compiling against it (#3879)	cebtenzzre
	* cmake : fix build when .git does not exist * cmake : simplify BUILD_INFO target * cmake : add missing dependencies on BUILD_INFO * build : link against build info instead of compiling against it * zig : make build info a .cpp source instead of a header Co-authored-by: Matheus C. França <matheus-catarino@hotmail.com> * cmake : revert change to CMP0115 --------- Co-authored-by: Matheus C. França <matheus-catarino@hotmail.com>
2023-11-01	scripts : add server-llm.sh (#3868)	Georgi Gerganov
	* scripts : add deploy-server.sh * scripts : rename to server-llm.sh * scripts : working curl pipe
2023-10-08	sync : ggml (ggml-backend) (#3548)	Georgi Gerganov
	* sync : ggml (ggml-backend) ggml-ci * zig : add ggml-backend to the build