summaryrefslogtreecommitdiff
path: root/examples/quantize-stats/quantize-stats.cpp
AgeCommit message (Expand)Author
2025-07-14Adding IQ2_KL (#602)Kawrakow
2025-05-23Fix MSVC compilation (#448)Kawrakow
2025-05-23Fix typo in non-AVX2 code branch (#445)Kawrakow
2025-05-23Trellis quants with CPU inference (#441)Andrew Chan
2025-04-07Add copyright notices (#317)Kawrakow
2024-10-16Adding IQ4_KSS: 4.0 bpw quants (#89)Kawrakow
2024-10-13IQ2_KS: 2.1875 bpw non-linear quantization (#85)Kawrakow
2024-10-10Better model info (#84)Kawrakow
2024-08-19quantize_stats: print rmse and max error as fraction of <x> (#21)Kawrakow
2024-07-27Merge mainline llama.cpp (#3)Kawrakow
2024-06-22bitnet: add 2 bpw quantizationIwan Kawrakow
2024-04-30Improve usability of --model-url & related flags (#6930)Olivier Chafik
2024-02-03refactor : switch to emplace_back to avoid extra object (#5291)Michael Klimenko
2024-01-30SOTA 3-bit quants (#5196)Kawrakow
2023-12-07llama : per-layer KV cache + quantum K cache (#4309)Georgi Gerganov
2023-11-02build : link against build info instead of compiling against it (#3879)cebtenzzre
2023-09-28llama.cpp : split llama_context_params into model and context params (#3301)slaren
2023-09-18make : restore build-info.h dependency for several targets (#3205)Cebtenzzre
2023-09-15examples : add compiler version and target to build info (#2998)Cebtenzzre
2023-09-15check C++ code with -Wmissing-declarations (#3184)Cebtenzzre
2023-09-07fix some warnings from gcc and clang-tidy (#3038)Cebtenzzre
2023-08-21gguf : new file format with flexible meta data (beta) (#2398)Georgi Gerganov
2023-07-05ggml : generalize `quantize_fns` for simpler FP16 handling (#1237)Stephan Walter
2023-06-24llama : make model stateless and context stateful (llama_state) (#1797)Didzis Gosko
2023-06-16build : fix and ignore MSVC warnings (#1889)Borislav Stanimirov
2023-06-05ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)Kawrakow
2023-05-17Remove unused n_parts parameter (#1509)Stephan Walter
2023-05-01Add git-based build information for better issue tracking (#1232)DannyDaemonic
2023-04-20llama : multi-threaded quantization (#1075)Kawrakow
2023-04-17quantize-stats : fix bug in --type argumentGeorgi Gerganov
2023-04-14Expose type name from ggml (#970)Pavol Rusnak
2023-04-13llama : merge llama_internal.h into llama.hGeorgi Gerganov
2023-04-10Rewrite loading code to try to satisfy everyone:comex
2023-04-08Add quantize-stats command for testing quantization (#728)unbounded