ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2023-10-28	llama : add option for greedy sampling with probs (#3813)	Georgi Gerganov
2023-10-28	common : print that one line of the syntax help also to standard output (#3...	Henk Poley
2023-10-28	starcoder : add GPU offloading (#3827)	Georgi Gerganov
2023-10-28	speculative : ensure draft and target model vocab matches (#3812)	Kerfuffle
2023-10-27	llama : correctly report GGUFv3 format (#3818)	cebtenzzre
2023-10-27	simple : fix batch handling (#3803)	Thibault Terrasson
2023-10-27	cuda : improve text-generation and batched decoding performance (#3776)	Georgi Gerganov
2023-10-26	server : do not release slot on image input (#3798)	Georgi Gerganov
2023-10-25	batched-bench : print params at start	Georgi Gerganov
2023-10-25	log : disable pid in log filenames	Georgi Gerganov
2023-10-24	server : add parameter -tb N, --threads-batch N (#3584) (#3768)	cebtenzzre
2023-10-24	server : do not block system prompt update (#3767)	Georgi Gerganov
2023-10-24	sync : ggml (conv ops + cuda MSVC fixes) (#3765)	Georgi Gerganov
2023-10-24	cmake : add missed dependencies (#3763)	John Smith
2023-10-24	cuda : add batched cuBLAS GEMM for faster attention (#3749)	Georgi Gerganov
2023-10-24	Add more tokenizer tests (#3742)	Galunid
2023-10-24	metal : handle ggml_scale for n%4 != 0 (close #3754)	Georgi Gerganov
2023-10-23	Revert "make : add optional CUDA_NATIVE_ARCH (#2482)"	Georgi Gerganov
2023-10-23	issues : separate bug and enhancement template + no default title (#3748)	M. Yusuf Sarıgöz
2023-10-23	Update special token handling in conversion scripts for gpt2 derived tokenize...	Galunid
2023-10-23	llama : remove token functions with `context` args in favor of `model` (#3720)	Marcus Dunn
2023-10-23	Fix baichuan convert script not detecing model (#3739)	Galunid
2023-10-22	make : add optional CUDA_NATIVE_ARCH (#2482)	Alex
2023-10-22	server : parallel decoding and multimodal (#3677)	Georgi Gerganov
2023-10-22	Add test for MPT tokenization (#3728)	goerch
2023-10-22	readme : remove unsupported node.js library (#3703)	Ian Scrivener
2023-10-22	llama : validate special token ids are in range when loading GGUF model (#3635)	Kerfuffle
2023-10-22	main : escape prompt for cfg_negative_prompt and consecutive inputs in main w...	vvhg1
2023-10-22	batched : add len CLI argument	Georgi Gerganov
2023-10-20	CLBlast: Add outer loops over src0 for broadcasting in mulmat	shibe2
2023-10-20	sampling : refactor init to use llama_sampling_params (#3696)	Georgi Gerganov
2023-10-20	gguf : support big endian platform (#3552)	Qin Yue Chen
2023-10-20	server : fix uninitialized sampling context (close #3685)	Georgi Gerganov
2023-10-20	ggml : fix rope + llama minor optimizations (#3560)	Herman Semenov
2023-10-20	convert : restore compat with old Falcon models (#3680)	cebtenzzre
2023-10-19	multimodal : add BakLLaVA conversion support (#3682)	M. Yusuf Sarıgöz
2023-10-19	llava : avoid segfault in case of non-existent mmproj file (#3674)	M. Yusuf Sarıgöz
2023-10-18	readme : update hot topics	Georgi Gerganov
2023-10-18	speculative : bug fixes	Georgi Gerganov
2023-10-18	speculative : add tree-based sampling example (#3624)	Georgi Gerganov
2023-10-18	metal : implement q5_0 and q5_1 kernels (#3648)	Jhen-Jie Hong
2023-10-18	opencl : fix element-wise multiplication (#3656)	shibe2
2023-10-17	fix embeddings when using CUDA (#3657)	slaren
2023-10-17	llama : avoid fprintf in favor of LLAMA_LOG (#3538)	Georgi Gerganov
2023-10-17	readme : update hot-topics & models, detail windows release in usage (#3615)	BarfingLemurs
2023-10-17	CLBlast: Fix temporary buffer size for f16 conversion (wsize)	shibe2
2023-10-17	train-text-from-scratch : fix assert failure in ggml-alloc (#3618)	slaren
2023-10-17	editorconfig : remove trailing spaces	Georgi Gerganov
2023-10-17	server : documentation of JSON return value of /completion endpoint (#3632)	coezbek
2023-10-17	save-load-state : fix example + add ci test (#3655)	Georgi Gerganov