ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2025-02-23	Add new sweep-bench benchmark (#225)	saood06
2024-07-27	Merge mainline llama.cpp (#3)	Kawrakow
2024-06-15	Add `cvector-generator` example (#7514)	Xuan Son Nguyen
2024-06-13	`build`: rename main → llama-cli, server → llama-server, llava-cli → ll...	Olivier Chafik
2024-06-04	llama : remove beam search (#7736)	Georgi Gerganov
2024-05-14	ggml : add RPC backend (#6829)	Radoslav Gerganov
2024-04-11	eval-callback: Example how to use eval callback for debugging (#6576)	Pierrick Hymbert
2024-03-25	examples : add "retrieval" (#6193)	Minsoo Cheong
2024-03-19	gguf-split: split and merge gguf per batch of tensors (#6135)	Pierrick Hymbert
2024-03-10	llama : add support for GritLM (#5959)	DAN™
2024-02-13	gguf : add python reader example (#5216)	John
2024-01-28	ggml : add unified SYCL backend for Intel GPUs (#2690)	Abhilash Majumder
2024-01-13	metal : remove old API (#4919)	Georgi Gerganov
2024-01-12	Importance Matrix calculation (#4861)	Kawrakow
2024-01-08	examples : add passkey test (#3856)	Georgi Gerganov
2023-12-22	lookup : add prompt lookup decoding example (#4484)	LeonEricsson
2023-11-26	lookahead : add example for lookahead decoding (#4207)	Georgi Gerganov
2023-11-17	examples : add tokenize (#4039)	zakkor
2023-10-20	sampling : refactor init to use llama_sampling_params (#3696)	Georgi Gerganov
2023-10-12	examples: support LLaVA v1.5 (multimodal model) (#3436)	M. Yusuf Sarıgöz
2023-10-11	batched : add bench tool (#3545)	Georgi Gerganov
2023-09-28	train : finetune LORA (#2632)	xaedes
2023-09-28	llama : custom attention mask + parallel decoding + no context swaps (#3228)	Georgi Gerganov
2023-09-03	speculative : PoC for speeding-up inference via speculative sampling (#2926)	Georgi Gerganov
2023-08-30	examples : fix underscore in beam-search + .gitignore (close #2900)	Georgi Gerganov
2023-08-25	llama : add llama_beam_search() (#2267)	Matt Pulver
2023-08-21	gguf : new file format with flexible meta data (beta) (#2398)	Georgi Gerganov
2023-08-18	llama : add benchmark example (#2626)	slaren
2023-08-12	Adding support for llama2.c models (#2559)	byte-6174
2023-08-04	Add --simple-io option for subprocesses and break out console.h and cpp (#1558)	DannyDaemonic
2023-07-23	llama : add grammar-based sampling (#1773)	Evan Jones
2023-06-28	llama : support input embeddings directly (#1910)	ningshanwutuobang
2023-06-17	llama : fix kv_cache `n` init (close #1903)	Georgi Gerganov
2023-06-13	train : improved training-from-scratch example (#1652)	xaedes
2023-06-04	llama : Metal inference (#1642)	Georgi Gerganov
2023-05-21	examples : add server example with REST API (#1443)	Steward Garcia
2023-05-13	ggml : implement backward pass for llama + small training-llama-from-scratch ...	xaedes
2023-04-30	Various fixes to mat_mul benchmark (#1253)	Stephan Walter
2023-04-24	examples : add save_load_state example (#1150)	xaedes
2023-04-08	Add quantize-stats command for testing quantization (#728)	unbounded
2023-03-25	Overhaul the examples structure	Georgi Gerganov