ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2024-08-12	Merge mainline - Aug 12 2024 (#17)	Kawrakow
	* Merge mainline * Fix after merge * Remove CI check --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
2024-07-27	Merge mainline llama.cpp (#3)	Kawrakow
	* Merging mainline - WIP * Merging mainline - WIP AVX2 and CUDA appear to work. CUDA performance seems slightly (~1-2%) lower as it is so often the case with llama.cpp/ggml after some "improvements" have been made. * Merging mainline - fix Metal * Remove check --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
2024-06-04	ggml : remove OpenCL (#7735)	Georgi Gerganov
	ggml-ci
2024-05-29	scripts : remove mpi remnants	Georgi Gerganov

2024-05-14	script : sync ggml-rpc	Georgi Gerganov

2024-04-09	license : update copyright notice + add AUTHORS (#6405)	Georgi Gerganov
	* license : add AUTHORS * authors : update * scipts : add LICENSE and gen-authors.sh to sync
2024-03-29	sync : ggml (#6351)	Georgi Gerganov
	* sync : ggml ggml-ci * cuda : move GGML_CUDA_DMMV constants to dmmv.cuh --------- Co-authored-by: slaren <slarengh@gmail.com>
2024-03-09	ggml : add ggml-common.h to deduplicate shared code (#5940)	Georgi Gerganov
	* ggml : add ggml-common.h to shared code ggml-ci * scripts : update sync scripts * sycl : reuse quantum tables ggml-ci * ggml : minor * ggml : minor * sycl : try to fix build
2024-02-10	scripts : update sync scripts with new backends	Georgi Gerganov

2023-12-07	sync : ggml (new ops, tests, backend, etc.) (#4359)	Georgi Gerganov
	* sync : ggml (part 1) * sync : ggml (part 2, CUDA) * sync : ggml (part 3, Metal) * ggml : build fixes ggml-ci * cuda : restore lost changes * cuda : restore lost changes (StableLM rope) * cmake : enable separable compilation for CUDA ggml-ci * ggml-cuda : remove device side dequantize * Revert "cmake : enable separable compilation for CUDA" This reverts commit 09e35d04b1c4ca67f9685690160b35bc885a89ac. * cuda : remove assert for rope * tests : add test-backend-ops * ggml : fix bug in ggml_concat * ggml : restore `ggml_get_n_tasks()` logic in `ggml_graph_plan()` * ci : try to fix macOS * ggml-backend : remove backend self-registration * ci : disable Metal for macOS cmake build ggml-ci * metal : fix "supports family" call * metal : fix assert * metal : print resource path ggml-ci --------- Co-authored-by: slaren <slarengh@gmail.com>
2023-11-13	sync : ggml (backend v2) (#3912)	Georgi Gerganov
	* sync : ggml (backend v2) (wip) * sync : migrate examples and llama.cpp to dynamic graphs (wip) * sync : update tests + fix max op params to 64 ggml-ci * sync : ggml-cuda ggml-ci * llama : fix save/load state context size ggml-ci * sync : try to fix build on tvOS * sync : pass custom graph sizes in training examples * sync : update graph copies to new ggml API * sync : update sync-ggml.sh with new files * scripts : fix header in sync script * train : fix context size calculations * llama : increase inference graph size up to 4096 nodes * train : allocate grads for backward graphs * train : allocate grads for gb_tmp
2023-10-08	sync : ggml (ggml-backend) (#3548)	Georgi Gerganov
	* sync : ggml (ggml-backend) ggml-ci * zig : add ggml-backend to the build
2023-08-22	ggml : sync latest (SAM + SD operators, CUDA alibi) (#2709)	Georgi Gerganov
	* ggml : sync latest (SAM + SD operators, CUDA alibi) ggml-ci * ggml : fix tabs
2023-08-02	tests : Fix compilation warnings (Linux/GCC) (#2451)	Eve
	* fix hellaswag print format, cast away warning in test-double-float * c++11 cannot use designated initializers * add static to test-grad0.c internal functions * use memcpy in test-double-float.c * port c tests to c++ * use initializer list for ggml_init_params
2023-07-05	tests : fix test-grad0	Georgi Gerganov

2023-07-04	ggml : sync latest (new ops, macros, refactoring) (#2106)	Georgi Gerganov
	- add ggml_argmax() - add ggml_tanh() - add ggml_elu() - refactor ggml_conv_1d() and variants - refactor ggml_conv_2d() and variants - add helper macros to reduce code duplication in ggml.c
2023-04-23	scripts : add helper scripts to synch ggml repo	Georgi Gerganov