ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2024-06-17	Add support for sqrt on CUDA (#7953)	Calvin Laurenson
2024-06-15	Add `cvector-generator` example (#7514)	Xuan Son Nguyen
2024-06-14	llama-bench : fix RPC indication (#7936)	Radoslav Gerganov
2024-06-13	move BLAS to a separate backend (#6210)	slaren
2024-06-13	`build`: rename main → llama-cli, server → llama-server, llava-cli → ll...	Olivier Chafik
2024-06-12	server : restore numeric prompts (#7883)	Georgi Gerganov
2024-06-11	llama-bench: more compact markdown tables (#7879)	Johannes Gäßler
2024-06-11	json: refine constraint for whitespace to avoid runaways yet allow pretty pri...	Olivier Chafik
2024-06-11	`json`: document schema conversion in GBNF readme, align manual grammar examp...	Olivier Chafik
2024-06-10	examples : remove --instruct remnants (#7846)	Georgi Gerganov
2024-06-10	server : improve "prompt" handling (#7847)	Georgi Gerganov
2024-06-09	imatrix : handle partial entries (#7833)	Georgi Gerganov
2024-06-09	server: do not remove whitespace at the start of a completion chunk (#7830)	mgroeber9110
2024-06-09	Revert "[SYCL] Update rpc-server.cpp to include SYCL backend (#7682)" (#7808)	slaren
2024-06-08	server : smart slot selection using Longest Common Prefix (#7728)	sasha0552
2024-06-07	gguf-split : change binary multi-byte units to decimal (#7803)	Christian Zhou-Zheng
2024-06-07	server: update cache_prompt documentation [no ci] (#7745)	Johannes Gäßler
2024-06-07	server : do not get prompt in infill mode (#7286)	woodx
2024-06-07	check for nans in imatrix and quantize (#7807)	slaren
2024-06-06	imatrix : migrate to gpt_params (#7771)	Georgi Gerganov
2024-06-06	grammars: x{min,max} repetition operator (#6640)	Olivier Chafik
2024-06-05	ggml : refactor rope norm/neox (#7634)	Georgi Gerganov
2024-06-05	readme : remove -ins (#7759)	arch-btw
2024-06-04	common : refactor cli arg parsing (#7675)	Georgi Gerganov
2024-06-04	ggml : remove OpenCL (#7735)	Georgi Gerganov
2024-06-04	llama : remove beam search (#7736)	Georgi Gerganov
2024-06-04	llama-bench : allow using a different printer for stderr with -oe (#7722)	slaren
2024-06-02	[SYCL] Update rpc-server.cpp to include SYCL backend (#7682)	nickp27
2024-06-01	server : new UI (#7633)	Yazan Agha-Schrader
2024-06-02	SimpleChat: Simple histogram/repeatMatching driven garbageTrimming, Settings ...	HanishKVC
2024-05-31	server : update js (#7670)	Georgi Gerganov
2024-05-30	Move convert.py to examples/convert-legacy-llama.py (#7430)	Galunid
2024-05-29	llama-bench : add support for the RPC backend (#7435)	Radoslav Gerganov
2024-05-28	server: do not remove whitespace at the start of a completion chunk (#7524)	mgroeber9110
2024-05-28	Markdownish code block fix (#7571)	Nathan Epstein
2024-05-28	llava : update clip.h (#7580)	Ikko Eltociear Ashimine
2024-05-27	main: replace --no-special with --special (#7534)	Brian
2024-05-26	SimpleChat Completion Mode flexibility and cleanup, Settings gMe, Optional sl...	HanishKVC
2024-05-25	train : change default FA argument (#7528)	Georgi Gerganov
2024-05-25	main : don't print special tokens with --grammar (#6923)	Justine Tunney
2024-05-25	android : module (#7502)	Elton Kola
2024-05-25	Make tokenize CLI tool have nicer command line arguments. (#6188)	Mikko Juola
2024-05-24	add build shared lib in win release package (#7438)	Neo Zhang
2024-05-23	ggml : remove ggml_flash_attn and ggml_flash_ff (#7463)	Georgi Gerganov
2024-05-23	main : minor (#7462)	Georgi Gerganov
2024-05-23	SimpleChat: a simple and dumb web front end for testing /chat/completions and...	HanishKVC
2024-05-22	common : normalize naming style (#7462)	Georgi Gerganov
2024-05-22	phi3 : duplicate rope factors in each layer (#7447)	slaren
2024-05-21	llama : add phi3 128K model support (#7225)	liuwei-git
2024-05-21	`grammars`: fix resampling logic regression (#7424)	Olivier Chafik