ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2024-05-03	llama : rename ctx to user_data in progress_callback (#7045)	Daniel Bevenius
2024-05-03	Remove .attention from skipped tensors to match more accurately (#7051)	Bartowski
2024-05-02	chore: fix typo in llama.cpp (#7032)	alwqx
2024-05-01	Update LOG_IMPL and LOG_TEE_IMPL (#7029)	Andrew Downing
2024-05-01	main : fix off by one error for context shift (#6921)	l3utterfly
2024-05-01	Server: add tests for batch size, different seeds (#6950)	Johannes Gäßler
2024-05-01	CUDA: CUDART < 11.7 workaround for __hmax, __hmax2 (#7019)	Johannes Gäßler
2024-05-01	ci : exempt confirmed bugs from being tagged as stale (#7014)	slaren
2024-04-30	perplexity: more statistics, added documentation (#6936)	Johannes Gäßler
2024-04-30	switch to using localizedDescription (#7010)	Kevin Gibbons
2024-04-30	metal : remove deprecated error code (#7008)	Georgi Gerganov
2024-04-30	metal : log more info on error (#6987)	Kevin Gibbons
2024-04-30	ggml : add Flash Attention (#5021)	Georgi Gerganov
2024-04-30	convert : use utf8 encoding (#7000)	Georgi Gerganov
2024-04-30	Improve usability of --model-url & related flags (#6930)	Olivier Chafik
2024-04-29	Extending grammar integration tests (#6644)	Clint Herron
2024-04-29	main : fix typo in comment in main.cpp (#6985)	Daniel Bevenius
2024-04-29	build(cmake): simplify instructions (`cmake -B build && cmake --build build ....	Olivier Chafik
2024-04-29	ci : tmp disable gguf-split (#6983)	Georgi Gerganov
2024-04-29	ggml : fix __MSC_VER -> _MSC_VER (#6977)	Georgi Gerganov
2024-04-29	llava-cli : multiple images (#6969)	cpumaxx
2024-04-29	readme : update hot topics	Georgi Gerganov
2024-04-29	llama : fix BPE pre-tokenization (#6920)	Georgi Gerganov
2024-04-29	sampling : use std::random_device{}() for default random seed (#6962)	David Renshaw
2024-04-29	convert : fix conversion of some BERT embedding models (#6937)	Christian Zhou-Zheng
2024-04-29	make : change GNU make default CXX from g++ to c++ (#6966)	Przemysław Pawełczyk
2024-04-29	ci : add building in MSYS2 environments (Windows) (#6967)	Przemysław Pawełczyk
2024-04-29	llama : fix typo LAMMAFILE -> LLAMAFILE (#6974)	Johannes Gäßler
2024-04-29	Fix more int overflow during quant (PPL/CUDA). (#6563)	DAN™
2024-04-28	gguf : enforce that tensor names are unique (#6905)	Xuan Son Nguyen
2024-04-28	add device version in device list (#6959)	Neo Zhang
2024-04-28	flake.lock: Update	github-actions[bot]
2024-04-27	Replace "alternative" boolean operator in conditional compilation directive (...	mgroeber9110
2024-04-27	ci: server: tests python env on github container ubuntu latest / fix n_predic...	Pierrick Hymbert
2024-04-26	Reset schedule earlier to allow overlap with ggml graph computation on device...	agray3
2024-04-26	quantize: add imatrix and dataset metadata in GGUF (#6658)	Pierrick Hymbert
2024-04-26	add basic tensor data validation function (#6884)	slaren
2024-04-26	gguf : fix mismatch between alloc and free functions (#6929)	slaren
2024-04-26	llamafile : use 64-bit integers in sgemm (#6928)	Justine Tunney
2024-04-26	ci: server: fix python installation (#6925)	Pierrick Hymbert
2024-04-26	server: stop generation at `n_ctx_train` if `n_predict` is not set (#6638)	Pierrick Hymbert
2024-04-26	ci: server: fix python installation (#6922)	Pierrick Hymbert
2024-04-26	Merge pull request from GHSA-p5mv-gjc5-mwqv	Georgi Gerganov
2024-04-26	ci: server: fix python installation (#6918)	Pierrick Hymbert
2024-04-26	ci: fix concurrency for pull_request_target (#6917)	Pierrick Hymbert
2024-04-26	bench: server add stop word for PHI-2 (#6916)	Pierrick Hymbert
2024-04-25	llava : add support for moondream vision language model (#6899)	vik
2024-04-25	cmake : restore LLAMA_LLAMAFILE_DEFAULT	Georgi Gerganov
2024-04-25	cmake : remove obsolete ANDROID check	Georgi Gerganov
2024-04-25	llama : synchronize before get/set session data (#6911)	slaren