index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
examples
/
quantize-stats
/
quantize-stats.cpp
Age
Commit message (
Expand
)
Author
2025-07-14
Adding IQ2_KL (#602)
Kawrakow
2025-05-23
Fix MSVC compilation (#448)
Kawrakow
2025-05-23
Fix typo in non-AVX2 code branch (#445)
Kawrakow
2025-05-23
Trellis quants with CPU inference (#441)
Andrew Chan
2025-04-07
Add copyright notices (#317)
Kawrakow
2024-10-16
Adding IQ4_KSS: 4.0 bpw quants (#89)
Kawrakow
2024-10-13
IQ2_KS: 2.1875 bpw non-linear quantization (#85)
Kawrakow
2024-10-10
Better model info (#84)
Kawrakow
2024-08-19
quantize_stats: print rmse and max error as fraction of <x> (#21)
Kawrakow
2024-07-27
Merge mainline llama.cpp (#3)
Kawrakow
2024-06-22
bitnet: add 2 bpw quantization
Iwan Kawrakow
2024-04-30
Improve usability of --model-url & related flags (#6930)
Olivier Chafik
2024-02-03
refactor : switch to emplace_back to avoid extra object (#5291)
Michael Klimenko
2024-01-30
SOTA 3-bit quants (#5196)
Kawrakow
2023-12-07
llama : per-layer KV cache + quantum K cache (#4309)
Georgi Gerganov
2023-11-02
build : link against build info instead of compiling against it (#3879)
cebtenzzre
2023-09-28
llama.cpp : split llama_context_params into model and context params (#3301)
slaren
2023-09-18
make : restore build-info.h dependency for several targets (#3205)
Cebtenzzre
2023-09-15
examples : add compiler version and target to build info (#2998)
Cebtenzzre
2023-09-15
check C++ code with -Wmissing-declarations (#3184)
Cebtenzzre
2023-09-07
fix some warnings from gcc and clang-tidy (#3038)
Cebtenzzre
2023-08-21
gguf : new file format with flexible meta data (beta) (#2398)
Georgi Gerganov
2023-07-05
ggml : generalize `quantize_fns` for simpler FP16 handling (#1237)
Stephan Walter
2023-06-24
llama : make model stateless and context stateful (llama_state) (#1797)
Didzis Gosko
2023-06-16
build : fix and ignore MSVC warnings (#1889)
Borislav Stanimirov
2023-06-05
ggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)
Kawrakow
2023-05-17
Remove unused n_parts parameter (#1509)
Stephan Walter
2023-05-01
Add git-based build information for better issue tracking (#1232)
DannyDaemonic
2023-04-20
llama : multi-threaded quantization (#1075)
Kawrakow
2023-04-17
quantize-stats : fix bug in --type argument
Georgi Gerganov
2023-04-14
Expose type name from ggml (#970)
Pavol Rusnak
2023-04-13
llama : merge llama_internal.h into llama.h
Georgi Gerganov
2023-04-10
Rewrite loading code to try to satisfy everyone:
comex
2023-04-08
Add quantize-stats command for testing quantization (#728)
unbounded