ik_llama.cpp.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Expand)	Author
2023-11-13	ggml : sync (im2col, GPU conv, 32-bit arm compat) (#4060)	Georgi Gerganov
2023-11-13	sync : ggml (backend v2) (#3912)	Georgi Gerganov
2023-11-07	ggml : fix backward rope after YaRN (#3974)	xaedes
2023-11-02	gguf : print error for GGUFv1 files (#3908)	Georgi Gerganov
2023-11-02	gguf : remove special-case code for GGUFv1 (#3901)	Georgi Gerganov
2023-11-01	llama : implement YaRN RoPE scaling (#2268)	cebtenzzre
2023-11-01	finetune : add -ngl parameter (#3762)	Andrew Godfrey
2023-10-30	ggml : move FP16 <-> FP32 code to ggml-impl.h (#3861)	Georgi Gerganov
2023-10-29	ggml : quantization refactoring (#3833)	Georgi Gerganov
2023-10-24	sync : ggml (conv ops + cuda MSVC fixes) (#3765)	Georgi Gerganov
2023-10-24	cuda : add batched cuBLAS GEMM for faster attention (#3749)	Georgi Gerganov
2023-10-20	gguf : support big endian platform (#3552)	Qin Yue Chen
2023-10-20	ggml : fix rope + llama minor optimizations (#3560)	Herman Semenov
2023-10-13	ggml : add context enumeration functions (#3605)	slaren
2023-10-12	examples: support LLaVA v1.5 (multimodal model) (#3436)	M. Yusuf Sarıgöz
2023-10-10	llm : add MPT support (#3417)	Jan Ploski
2023-10-09	refact : fix convert script + zero out KV cache to avoid nans (#3523)	Georgi Gerganov
2023-10-08	sync : ggml (ggml-backend) (#3548)	Georgi Gerganov
2023-10-04	ggml : fix build after #3329	Georgi Gerganov
2023-10-04	llm : add Refact model (#3329)	ds5t5
2023-10-04	sync : ggml (conv 1d + 2d updates, UB fixes) (#3468)	Georgi Gerganov
2023-10-03	ggml : add RISC-V Vector Support for K-Quants and improved the existing intri...	Tameem
2023-10-02	CLBlast: Add broadcast support for matrix multiplication (#3402)	shibe2
2023-09-28	build : enable more non-default compiler warnings (#3200)	Cebtenzzre
2023-09-28	ggml : release the requested thread pool resource (#3292)	Qu Zongfu
2023-09-28	train : finetune LORA (#2632)	xaedes
2023-09-28	gguf : basic type checking in gguf_get_* (#3346)	Cebtenzzre
2023-09-28	llama : custom attention mask + parallel decoding + no context swaps (#3228)	Georgi Gerganov
2023-09-15	sync : ggml (Metal F32 support + reduce ggml-alloc size) (#3192)	Georgi Gerganov
2023-09-15	metal : relax conditions on fast matrix multiplication kernel (#3168)	Georgi Gerganov
2023-09-12	arm64 support for windows (#3007)	Eric Sommerlade
2023-09-08	sync : ggml (CUDA GLM RoPE + POSIX) (#3082)	Georgi Gerganov
2023-09-08	build : do not use _GNU_SOURCE gratuitously (#2035)	Przemysław Pawełczyk
2023-09-08	enable CPU HBM (#2603)	Kunshang Ji
2023-09-07	fix some warnings from gcc and clang-tidy (#3038)	Cebtenzzre
2023-09-07	ggml : posixify madvise and pagesize (#3037)	Przemysław Pawełczyk
2023-09-02	k-quants : fix build on armv7 (android only) (#2920)	Jhen-Jie Hong
2023-09-01	ggml : add RISC-V vector intrinsics support (#2929)	Tameem
2023-08-29	ggml : add view_src and view_offs to ggml_tensor for views (#2874)	slaren
2023-08-28	train : mem usage and other improvements (#2439)	xaedes
2023-08-28	ggml : sync (mem align to header + conv_transpose_2d fixes + ggml_alloc) (#2852)	Georgi Gerganov
2023-08-28	metal : fix memory leak (#2762)	Georgi Gerganov
2023-08-27	gguf : fix strings to not be null-terminated (#2839)	Georgi Gerganov
2023-08-27	gguf : add 64-bit support (GGUF v2) (#2821)	Georgi Gerganov
2023-08-27	ggml : detect SSSE3 (#2825)	Przemysław Pawełczyk
2023-08-23	llm : add Falcon support (#2717)	Georgi Gerganov
2023-08-22	ggml : sync latest (SAM + SD operators, CUDA alibi) (#2709)	Georgi Gerganov
2023-08-21	gguf : new file format with flexible meta data (beta) (#2398)	Georgi Gerganov
2023-08-20	ggml : move all type info to ggml_type_traits (#2663)	slaren
2023-08-07	ggml : mul mat tweaks (#2372)	Georgi Gerganov