add imatrix origin
Browse files
README.md
CHANGED
@@ -23,7 +23,7 @@ An example of a complex debugging scenario is where you build library A on top o
|
|
23 |
|
24 |

|
25 |
|
26 |
-
IQ here refers to Importance Matrix Quantization. For performance comparison against regular GGUF, please read [this Reddit post](https://www.reddit.com/r/LocalLLaMA/comments/1993iro/ggufs_quants_can_punch_above_their_weights_now/).
|
27 |
|
28 |
<br>
|
29 |
|
|
|
23 |
|
24 |

|
25 |
|
26 |
+
IQ here refers to Importance Matrix Quantization. For performance comparison against regular GGUF, please read [this Reddit post](https://www.reddit.com/r/LocalLLaMA/comments/1993iro/ggufs_quants_can_punch_above_their_weights_now/). For more info on the techique, please see [this GitHub discussion](https://github.com/ggerganov/llama.cpp/discussions/5006/).
|
27 |
|
28 |
<br>
|
29 |
|