summaryrefslogtreecommitdiff
path: root/ggml.c
AgeCommit message (Expand)Author
2023-11-13ggml : sync (im2col, GPU conv, 32-bit arm compat) (#4060)Georgi Gerganov
2023-11-13sync : ggml (backend v2) (#3912)Georgi Gerganov
2023-11-07ggml : fix backward rope after YaRN (#3974)xaedes
2023-11-02gguf : print error for GGUFv1 files (#3908)Georgi Gerganov
2023-11-02gguf : remove special-case code for GGUFv1 (#3901)Georgi Gerganov
2023-11-01llama : implement YaRN RoPE scaling (#2268)cebtenzzre
2023-11-01finetune : add -ngl parameter (#3762)Andrew Godfrey
2023-10-30ggml : move FP16 <-> FP32 code to ggml-impl.h (#3861)Georgi Gerganov
2023-10-29ggml : quantization refactoring (#3833)Georgi Gerganov
2023-10-24sync : ggml (conv ops + cuda MSVC fixes) (#3765)Georgi Gerganov
2023-10-24cuda : add batched cuBLAS GEMM for faster attention (#3749)Georgi Gerganov
2023-10-20gguf : support big endian platform (#3552)Qin Yue Chen
2023-10-20ggml : fix rope + llama minor optimizations (#3560)Herman Semenov
2023-10-13ggml : add context enumeration functions (#3605)slaren
2023-10-12examples: support LLaVA v1.5 (multimodal model) (#3436)M. Yusuf Sarıgöz
2023-10-10llm : add MPT support (#3417)Jan Ploski
2023-10-09refact : fix convert script + zero out KV cache to avoid nans (#3523)Georgi Gerganov
2023-10-08sync : ggml (ggml-backend) (#3548)Georgi Gerganov
2023-10-04ggml : fix build after #3329Georgi Gerganov
2023-10-04llm : add Refact model (#3329)ds5t5
2023-10-04sync : ggml (conv 1d + 2d updates, UB fixes) (#3468)Georgi Gerganov
2023-10-03ggml : add RISC-V Vector Support for K-Quants and improved the existing intri...Tameem
2023-10-02CLBlast: Add broadcast support for matrix multiplication (#3402)shibe2
2023-09-28build : enable more non-default compiler warnings (#3200)Cebtenzzre
2023-09-28ggml : release the requested thread pool resource (#3292)Qu Zongfu
2023-09-28train : finetune LORA (#2632)xaedes
2023-09-28gguf : basic type checking in gguf_get_* (#3346)Cebtenzzre
2023-09-28llama : custom attention mask + parallel decoding + no context swaps (#3228)Georgi Gerganov
2023-09-15sync : ggml (Metal F32 support + reduce ggml-alloc size) (#3192)Georgi Gerganov
2023-09-15metal : relax conditions on fast matrix multiplication kernel (#3168)Georgi Gerganov
2023-09-12arm64 support for windows (#3007)Eric Sommerlade
2023-09-08sync : ggml (CUDA GLM RoPE + POSIX) (#3082)Georgi Gerganov
2023-09-08build : do not use _GNU_SOURCE gratuitously (#2035)Przemysław Pawełczyk
2023-09-08enable CPU HBM (#2603)Kunshang Ji
2023-09-07fix some warnings from gcc and clang-tidy (#3038)Cebtenzzre
2023-09-07ggml : posixify madvise and pagesize (#3037)Przemysław Pawełczyk
2023-09-02k-quants : fix build on armv7 (android only) (#2920)Jhen-Jie Hong
2023-09-01ggml : add RISC-V vector intrinsics support (#2929)Tameem
2023-08-29ggml : add view_src and view_offs to ggml_tensor for views (#2874)slaren
2023-08-28train : mem usage and other improvements (#2439)xaedes
2023-08-28ggml : sync (mem align to header + conv_transpose_2d fixes + ggml_alloc) (#2852)Georgi Gerganov
2023-08-28metal : fix memory leak (#2762)Georgi Gerganov
2023-08-27gguf : fix strings to not be null-terminated (#2839)Georgi Gerganov
2023-08-27gguf : add 64-bit support (GGUF v2) (#2821)Georgi Gerganov
2023-08-27ggml : detect SSSE3 (#2825)Przemysław Pawełczyk
2023-08-23llm : add Falcon support (#2717)Georgi Gerganov
2023-08-22ggml : sync latest (SAM + SD operators, CUDA alibi) (#2709)Georgi Gerganov
2023-08-21gguf : new file format with flexible meta data (beta) (#2398)Georgi Gerganov
2023-08-20ggml : move all type info to ggml_type_traits (#2663)slaren
2023-08-07ggml : mul mat tweaks (#2372)Georgi Gerganov