diff options
author | Georgi Gerganov <ggerganov@gmail.com> | 2024-01-17 18:46:30 +0200 |
---|---|---|
committer | GitHub <noreply@github.com> | 2024-01-17 18:46:30 +0200 |
commit | ba69bbc84ced580fe4fdb0713ca2d95634325b7a (patch) | |
tree | 75628460a0036db7ca1105bf3093f76c71ab3eed /ggml.h | |
parent | 44a1a4a41a4c0b03afaa7d9e06bcbc7cf95aa1e6 (diff) |
imatrix : offload to GPU support (#4957)
* backend : add eval callback
ggml-ci
* backend : group nodes in a single compute when user don't need them
* backend : clean-up the implementation
ggml-ci
* simple : do not perform tensor data copy if not needed
* simple : fix
* imatrix : offload to GPU support
* imatrix : fix ggml_mul_mat_id hanlding
ggml-ci
* ci : add imatrix test
ggml-ci
* ci : rearrange output
ggml-ci
Diffstat (limited to 'ggml.h')
-rw-r--r-- | ggml.h | 6 |
1 files changed, 0 insertions, 6 deletions
@@ -2086,12 +2086,6 @@ extern "C" { GGML_API void ggml_deinit_iq2_quantization(enum ggml_type type); // - // Importance matrix - // - typedef void(*ggml_collect_imatrix_t)(const struct ggml_tensor * src0, const struct ggml_tensor * src1); - GGML_API void ggml_set_imatrix_collection(ggml_collect_imatrix_t imatrix_collect); - - // // gguf // |