imatrix : offload to GPU support (#4957)

* backend : add eval callback ggml-ci * backend : group nodes in a single compute when user don't need them * backend : clean-up the implementation ggml-ci * simple : do not perform tensor data copy if not needed * simple : fix * imatrix : offload to GPU support * imatrix : fix ggml_mul_mat_id hanlding ggml-ci * ci : add imatrix test ggml-ci * ci : rearrange output ggml-ci
author: Georgi Gerganov <ggerganov@gmail.com> 2024-01-17 18:46:30 +0200
committer: GitHub <noreply@github.com> 2024-01-17 18:46:30 +0200
commit: ba69bbc84ced580fe4fdb0713ca2d95634325b7a (patch)
tree: 75628460a0036db7ca1105bf3093f76c71ab3eed /ggml.h
parent: 44a1a4a41a4c0b03afaa7d9e06bcbc7cf95aa1e6 (diff)
1 files changed, 0 insertions, 6 deletions
diff --git a/ggml.h b/ggml.h
index 837c52e6..27daf6fd 100644
--- a/ggml.h
+++ b/ggml.h
@@ -2086,12 +2086,6 @@ extern "C" {
     GGML_API void ggml_deinit_iq2_quantization(enum ggml_type type);
 
     //
-    // Importance matrix
-    //
-    typedef void(*ggml_collect_imatrix_t)(const struct ggml_tensor * src0, const struct ggml_tensor * src1);
-    GGML_API void ggml_set_imatrix_collection(ggml_collect_imatrix_t imatrix_collect);
-
-    //
     // gguf
     //
author	Georgi Gerganov <ggerganov@gmail.com>	2024-01-17 18:46:30 +0200
committer	GitHub <noreply@github.com>	2024-01-17 18:46:30 +0200
commit	ba69bbc84ced580fe4fdb0713ca2d95634325b7a (patch)
tree	75628460a0036db7ca1105bf3093f76c71ab3eed /ggml.h
parent	44a1a4a41a4c0b03afaa7d9e06bcbc7cf95aa1e6 (diff)