summaryrefslogtreecommitdiff
path: root/examples/imatrix/README.md
diff options
context:
space:
mode:
Diffstat (limited to 'examples/imatrix/README.md')
-rw-r--r--examples/imatrix/README.md4
1 files changed, 2 insertions, 2 deletions
diff --git a/examples/imatrix/README.md b/examples/imatrix/README.md
index 38b36ee5..bb5faec9 100644
--- a/examples/imatrix/README.md
+++ b/examples/imatrix/README.md
@@ -1,6 +1,6 @@
# llama.cpp/examples/imatrix
-Compute an importance matrix for a model and given text dataset. Can be used during quantization to enchance the quality of the quantum models.
+Compute an importance matrix for a model and given text dataset. Can be used during quantization to enchance the quality of the quantized models.
More information is available here: https://github.com/ggerganov/llama.cpp/pull/4861
## Usage
@@ -25,7 +25,7 @@ For faster computation, make sure to use GPU offloading via the `-ngl` argument
## Example
```bash
-LLAMA_CUDA=1 make -j
+GGML_CUDA=1 make -j
# generate importance matrix (imatrix.dat)
./llama-imatrix -m ggml-model-f16.gguf -f train-data.txt -ngl 99