FredZhang7 commited on
Commit
ec2367d
·
verified ·
1 Parent(s): 6c863d3

add imatrix origin

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -23,7 +23,7 @@ An example of a complex debugging scenario is where you build library A on top o
23
 
24
  ![](./model_v0.1_throughput_comparison.png)
25
 
26
- IQ here refers to Importance Matrix Quantization. For performance comparison against regular GGUF, please read [this Reddit post](https://www.reddit.com/r/LocalLLaMA/comments/1993iro/ggufs_quants_can_punch_above_their_weights_now/).
27
 
28
  <br>
29
 
 
23
 
24
  ![](./model_v0.1_throughput_comparison.png)
25
 
26
+ IQ here refers to Importance Matrix Quantization. For performance comparison against regular GGUF, please read [this Reddit post](https://www.reddit.com/r/LocalLLaMA/comments/1993iro/ggufs_quants_can_punch_above_their_weights_now/). For more info on the techique, please see [this GitHub discussion](https://github.com/ggerganov/llama.cpp/discussions/5006/).
27
 
28
  <br>
29