index
:
ik_llama.cpp.git
main
Unnamed repository; edit this file 'description' to name the repository.
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
examples
/
imatrix
/
imatrix.cpp
Age
Commit message (
Expand
)
Author
2025-05-13
Fix imatrix calculation for MLA models (#411)
Kawrakow
2025-04-14
imatrix: collect layer influence statistics (#328)
Kawrakow
2025-03-10
DeepSeek imatrix stuff (#250)
Kawrakow
2025-02-12
Fix imatrix overprotectiveness (#202)
Kawrakow
2024-08-12
Merge mainline - Aug 12 2024 (#17)
Kawrakow
2024-07-24
Add copyright notices
Iwan Kawrakow
2024-06-26
imatrix: be able to specify the name of the output tensor
Iwan Kawrakow
2024-06-09
imatrix : handle partial entries (#7833)
Georgi Gerganov
2024-06-07
check for nans in imatrix and quantize (#7807)
slaren
2024-06-06
imatrix : migrate to gpt_params (#7771)
Georgi Gerganov
2024-06-04
common : refactor cli arg parsing (#7675)
Georgi Gerganov
2024-05-22
common : normalize naming style (#7462)
Georgi Gerganov
2024-05-08
Fixed save_imatrix to match old behaviour for MoE (#7099)
jukofyork
2024-04-26
quantize: add imatrix and dataset metadata in GGUF (#6658)
Pierrick Hymbert
2024-04-18
ggml : group all experts in a single ggml_mul_mat_id (#6505)
slaren
2024-04-12
imatrix : remove invalid assert (#6632)
Georgi Gerganov
2024-04-11
eval-callback: Example how to use eval callback for debugging (#6576)
Pierrick Hymbert
2024-04-09
BERT tokenizer fixes (#6498)
Jared Van Bortel
2024-04-03
ggml : mul_mat_id use the same tensor for all the experts (#6387)
slaren
2024-03-26
llama : greatly reduce output buffer memory usage (#6122)
compilade
2024-03-24
imatrix : fix wname for mul_mat_id ops (#6271)
Georgi Gerganov
2024-03-18
backend : offload large batches to GPU (#6083)
slaren
2024-02-16
ggml : add numa options (#5377)
bmwl
2024-02-04
Adding some imatrix tools (#5302)
Kawrakow
2024-01-22
imatrix : keep intermediate imatrix results (#5077)
Kawrakow
2024-01-21
Slightly faster imatrix (#5050)
Kawrakow
2024-01-18
imatrix : fix assert for src0 non-cont check
Georgi Gerganov
2024-01-17
imatrix : offload to GPU support (#4957)
Georgi Gerganov
2024-01-12
Importance Matrix calculation (#4861)
Kawrakow