summaryrefslogtreecommitdiff
path: root/examples/imatrix/imatrix.cpp
AgeCommit message (Expand)Author
2024-04-18ggml : group all experts in a single ggml_mul_mat_id (#6505)slaren
2024-04-12imatrix : remove invalid assert (#6632)Georgi Gerganov
2024-04-11eval-callback: Example how to use eval callback for debugging (#6576)Pierrick Hymbert
2024-04-09BERT tokenizer fixes (#6498)Jared Van Bortel
2024-04-03ggml : mul_mat_id use the same tensor for all the experts (#6387)slaren
2024-03-26llama : greatly reduce output buffer memory usage (#6122)compilade
2024-03-24imatrix : fix wname for mul_mat_id ops (#6271)Georgi Gerganov
2024-03-18backend : offload large batches to GPU (#6083)slaren
2024-02-16ggml : add numa options (#5377)bmwl
2024-02-04Adding some imatrix tools (#5302)Kawrakow
2024-01-22imatrix : keep intermediate imatrix results (#5077)Kawrakow
2024-01-21Slightly faster imatrix (#5050)Kawrakow
2024-01-18imatrix : fix assert for src0 non-cont checkGeorgi Gerganov
2024-01-17imatrix : offload to GPU support (#4957)Georgi Gerganov
2024-01-12Importance Matrix calculation (#4861)Kawrakow