| Age | Commit message (Expand) | Author |
| 2025-06-22 | Abort if IQK_IMPLEMENT is not defined | Iwan Kawrakow |
| 2025-06-21 | Faster ARM_NEON GEMM implementation for legacy quants (#546) | Kawrakow |
| 2025-06-21 | Perhaps slightly faster trellis quants (#541) | Kawrakow |
| 2025-06-20 | New integer trellis on ARM_NEON (#544) | Kawrakow |
| 2025-06-19 | Fix NEON build (#542) | Kawrakow |
| 2025-06-19 | Update CMakeLists.txt to fix NDEBUG handling (#537) | Anton Sokolchenko |
| 2025-06-19 | Fix missed block_q8_x2 bf16 -> i16 change (#540) | Kawrakow |
| 2025-06-18 | Fix KT Neon / ARM typo (#536) | Louie Helm |
| 2025-06-18 | Fix MSVC compilation error | Iwan Kawrakow |
| 2025-06-18 | New IQ2_KT, IQ3_KT and IQ4_KT, V2 (#529) | Kawrakow |
| 2025-06-18 | Much faster CPU prompt processing (part 3) (#534) | Kawrakow |
| 2025-06-18 | Much faster CPU prompt processing (part 2) (#533) | Kawrakow |
| 2025-06-17 | Much faster CPU prompt processing (part 1) (#531) | Kawrakow |
| 2025-06-14 | Call iqk_convert_repack in MoE GEMM (#528) | Kawrakow |
| 2025-06-13 | Faster CPU prompt processing for Q4_K and Q5_K (#525) | Kawrakow |
| 2025-06-13 | Perhaps a slightly better version for IQ2_XXS, IQ3_XXS, IQ3_S GEMV (#524) | Kawrakow |
| 2025-06-12 | Better strategy for GPU offload (#520) | Kawrakow |
| 2025-06-12 | iq3_s: much faster GEMM via repacking to q8_0_r8 (#518) | Kawrakow |
| 2025-06-11 | Faster iq1_s GEMM via repacking to Q8_0_R8 (#517) | Kawrakow |
| 2025-06-11 | Much faster iq3_xxs GEMM via repacking to q8_0_r8 (AVX2) (#516) | Kawrakow |
| 2025-06-11 | IQ2_XXS: much faster CPU prompt processing (#515) | Kawrakow |
| 2025-06-10 | Fix Compile error (C2668) (#508) | Gaolingx |
| 2025-06-08 | Fix non rpc build error (#506) | firecoperana |
| 2025-06-08 | Revert "Rpc improvement (#480)" | Iwan Kawrakow |
| 2025-06-08 | Rpc improvement (#480) | firecoperana |
| 2025-06-07 | Fix #499 (#501) | Kawrakow |
| 2025-06-05 | IQ1_M_R4 CUDA implementation (#494) | Kawrakow |
| 2025-06-05 | MMQ implementation for IQ4_KS_R4 and IQ5_KS_R4 (#493) | Kawrakow |
| 2025-06-05 | Faster CPU prompt processing for Trellis quants and MoE models (#488) | Kawrakow |
| 2025-06-05 | CUDA implementation for IQ1_S_R4 (#492) | Kawrakow |
| 2025-06-01 | Minor (~2%) iq2_ks TG performance improvement on CUDA (#468) | Kawrakow |
| 2025-06-01 | Trellis quants: faster CPU prompt processing (#482) | Kawrakow |
| 2025-06-01 | Metal implementatio for the trellis quants. (#475) | Kawrakow |
| 2025-05-29 | NEON implementation for trellis quants (#471) | Kawrakow |
| 2025-05-27 | CUDA GEMM and GEMV for IQ4_KS_R4 and IQ5_KS_R4 (#462) | Kawrakow |
| 2025-05-26 | CUDA implementation for IQ2_K_R4, IQ3_K_R4, IQ4_K_R4, IQ5_K_R4 (#461) | Kawrakow |
| 2025-05-24 | Legacy quants conversion schemes in convert_hf_to_gguf.py (#449) | Nexes the Elder |
| 2025-05-24 | Faster IQ3_KT and IQ4_KT (#453) | Kawrakow |
| 2025-05-23 | Fix bug in MMVQ kernel (#446) | Kawrakow |
| 2025-05-23 | Fix MSVC compilation (#448) | Kawrakow |
| 2025-05-23 | Trellis quants with CPU inference (#441) | Andrew Chan |
| 2025-05-22 | Refactor iqk_mul_mat.cpp (#435) | Kawrakow |
| 2025-05-20 | Bug fixes from mainline (#439) | Kawrakow |
| 2025-05-18 | Forgotten MMQ ref and typo (#431) | Nexes the Elder |
| 2025-05-17 | Option to enable disable the IQK CPU FA kernels (#429) | Kawrakow |
| 2025-05-17 | Zen4: Faster PP for IQ2_KS, IQ4_KS, IQ5_KS (#428) | Kawrakow |
| 2025-05-17 | IQ5_KS_R4: row-interleaved IQ5_KS (#426) | Kawrakow |
| 2025-05-16 | Fix AVX2 implementation of IQ4_K, IQ4_KS, IQ5_K, IQ6_K (#427) | Kawrakow |
| 2025-05-15 | Adding forgotten template instance for iq5_ks (#424) | Kawrakow |
| 2025-05-15 | Adding IQ5_KS - 5.25 bpw quants (#422) | Kawrakow |