ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2025-07-14	Adding IQ2_KL (#602)	Kawrakow
2025-05-23	Fix MSVC compilation (#448)	Kawrakow
2025-05-23	Fix typo in non-AVX2 code branch (#445)	Kawrakow
2025-05-23	Trellis quants with CPU inference (#441)	Andrew Chan
2025-04-07	Add copyright notices (#317)	Kawrakow
2024-10-16	Adding IQ4_KSS: 4.0 bpw quants (#89)	Kawrakow
2024-10-13	IQ2_KS: 2.1875 bpw non-linear quantization (#85)	Kawrakow
2024-10-10	Better model info (#84)	Kawrakow
2024-08-19	quantize_stats: print rmse and max error as fraction of <x> (#21)	Kawrakow
2024-07-27	Merge mainline llama.cpp (#3)	Kawrakow
2024-06-22	bitnet: add 2 bpw quantization	Iwan Kawrakow
2024-04-30	Improve usability of --model-url & related flags (#6930)	Olivier Chafik
2024-02-03	refactor : switch to emplace_back to avoid extra object (#5291)	Michael Klimenko
2024-01-30	SOTA 3-bit quants (#5196)	Kawrakow
2023-12-07	llama : per-layer KV cache + quantum K cache (#4309)	Georgi Gerganov
2023-11-02	build : link against build info instead of compiling against it (#3879)	cebtenzzre
2023-09-28	llama.cpp : split llama_context_params into model and context params (#3301)	slaren
2023-09-18	make : restore build-info.h dependency for several targets (#3205)	Cebtenzzre
2023-09-15	examples : add compiler version and target to build info (#2998)	Cebtenzzre
2023-09-15	check C++ code with -Wmissing-declarations (#3184)	Cebtenzzre
2023-09-07	fix some warnings from gcc and clang-tidy (#3038)	Cebtenzzre
2023-08-21	gguf : new file format with flexible meta data (beta) (#2398)	Georgi Gerganov
2023-07-05	ggml : generalize `quantize_fns` for simpler FP16 handling (#1237)	Stephan Walter
2023-06-24	llama : make model stateless and context stateful (llama_state) (#1797)	Didzis Gosko
2023-06-16	build : fix and ignore MSVC warnings (#1889)	Borislav Stanimirov
2023-06-05	ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)	Kawrakow
2023-05-17	Remove unused n_parts parameter (#1509)	Stephan Walter
2023-05-01	Add git-based build information for better issue tracking (#1232)	DannyDaemonic
2023-04-20	llama : multi-threaded quantization (#1075)	Kawrakow
2023-04-17	quantize-stats : fix bug in --type argument	Georgi Gerganov
2023-04-14	Expose type name from ggml (#970)	Pavol Rusnak
2023-04-13	llama : merge llama_internal.h into llama.h	Georgi Gerganov
2023-04-10	Rewrite loading code to try to satisfy everyone:	comex
2023-04-08	Add quantize-stats command for testing quantization (#728)	unbounded