ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2023-11-30	docker : add finetune option (#4211)	Juraj Bednar

2023-09-15	ci : Cloud-V for RISC-V builds (#3160)	Ali Tariq
	* Added Cloud-V File * Replaced Makefile with original one --------- Co-authored-by: moiz.hussain <moiz.hussain@10xengineers.ai>
2023-09-08	docker : add git to full-cuda.Dockerfile main-cuda.Dockerfile (#3044)	hongbo.mo

2023-08-30	[Docker] fix tools.sh argument passing. (#2884)	Henri Vasserman
	* [Docker] fix tools.sh argument passing. This should allow passing multiple arguments to containers with the full image that are using the tools.sh frontend. Fix from https://github.com/ggerganov/llama.cpp/issues/2535#issuecomment-1697091734
2023-08-28	devops : added systemd units and set versioning to use date. (#2835)	JohnnyB
	* Corrections and systemd units * Missing dependency clblast
2023-08-25	ROCm Port (#1087)	Henri Vasserman
	* use hipblas based on cublas * Update Makefile for the Cuda kernels * Expand arch list and make it overrideable * Fix multi GPU on multiple amd architectures with rocblas_initialize() (#5) * add hipBLAS to README * new build arg LLAMA_CUDA_MMQ_Y * fix half2 decomposition * Add intrinsics polyfills for AMD * AMD assembly optimized __dp4a * Allow overriding CC_TURING * use "ROCm" instead of "CUDA" * ignore all build dirs * Add Dockerfiles * fix llama-bench * fix -nommq help for non CUDA/HIP --------- Co-authored-by: YellowRoseCx <80486540+YellowRoseCx@users.noreply.github.com> Co-authored-by: ardfork <134447697+ardfork@users.noreply.github.com> Co-authored-by: funnbot <22226942+funnbot@users.noreply.github.com> Co-authored-by: Engininja2 <139037756+Engininja2@users.noreply.github.com> Co-authored-by: Kerfuffle <44031344+KerfuffleV2@users.noreply.github.com> Co-authored-by: jammm <2500920+jammm@users.noreply.github.com> Co-authored-by: jdecourval <7315817+jdecourval@users.noreply.github.com>
2023-08-23	devops : RPM Specs (#2723)	JohnnyB
	* Create llama-cpp.srpm * Rename llama-cpp.srpm to llama-cpp.srpm.spec Correcting extension. * Tested spec success. * Update llama-cpp.srpm.spec * Create lamma-cpp-cublas.srpm.spec * Create lamma-cpp-clblast.srpm.spec * Update lamma-cpp-cublas.srpm.spec Added BuildRequires * Moved to devops dir
2023-07-13	devops : add missing quotes to bash script (#2193)	Bodo Graumann
	This prevents accidentally expanding arguments that contain spaces.
2023-07-11	docker : add '--server' option (#2174)	Jinwoo Jeong

2023-07-07	docker : add support for CUDA in docker (#1461)	dylan
	Co-authored-by: canardleteer <eris.has.a.dad+github@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-06-08	Add llama.cpp docker support for non-latin languages (#1673)	qingfengfenga
	* Modify Dockerfile default character set to improve compatibility (#1673)
2023-06-03	Docker: change to calling convert.py (#1641)	Jiří Podivín
	Deprecation disclaimer was added to convert-pth-to-ggml.py
2023-05-28	Adding git in container package dependencies (#1621)	Jiří Podivín
	Git added to build packages for version information in docker image Signed-off-by: Jiri Podivin <jpodivin@gmail.com>
2023-04-26	quantize : use `map` to assign quantization type from `string` (#1191)	Pavol Rusnak
	instead of `int` (while `int` option still being supported) This allows the following usage: `./quantize ggml-model-f16.bin ggml-model-q4_0.bin q4_0` instead of: `./quantize ggml-model-f16.bin ggml-model-q4_0.bin 2`
2023-04-14	py : cleanup dependencies (#962)	Pavol Rusnak
	after #545 we do not need torch, tqdm and requests in the dependencies
2023-04-11	Fix whitespace, add .editorconfig, add GitHub workflow (#883)	Pavol Rusnak

2023-04-03	Remove torch GPU dependencies from the Docker.full image (#665)	bsilvereagle
	By using `pip install torch --index-url https://download.pytorch.org/whl/cpu` instead of `pip install torch` we can specify we want to install a CPU-only version of PyTorch without any GPU dependencies. This reduces the size of the Docker image from 7.32 GB to 1.62 GB
2023-03-23	Remove oboslete command from Docker script	Georgi Gerganov

2023-03-20	Add tqdm to Python requirements (#293)	Stephan Walter
	* Add tqdm to Python requirements * Remove torchvision torchaudio, add requests
2023-03-17	Don't tell users to use a bad number of threads (#243)	Stephan Walter
	The readme tells people to use the command line option "-t 8", causing 8 threads to be started. On systems with fewer than 8 cores, this causes a significant slowdown. Remove the option from the example command lines and use /proc/cpuinfo on Linux to determine a sensible default.
2023-03-17	🚀 Dockerize llamacpp (#132)	Bernat Vadell
	* feat: dockerize llamacpp * feat: split build & runtime stages * split dockerfile into main & tools * add quantize into tool docker image * Update .devops/tools.sh Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * add docker action pipeline * change CI to publish at github docker registry * fix name runs-on macOS-latest is macos-latest (lowercase) * include docker versioned images * fix github action docker * fix docker.yml * feat: include all-in-one command tool & update readme.md --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>