bnjmnmarie commited on
Commit
89c3ab5
·
verified ·
1 Parent(s): 7bbb100

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -8,12 +8,16 @@ tags:
8
  license: apache-2.0
9
  ---
10
 
11
-
12
 
13
  ## Model Details
14
 
15
  This is [Phi-4-mini-instruct](https://huggingface.co/microsoft/Phi-4-mini-instruct) quantized with [AutoRound](https://github.com/intel/auto-round/tree/main) (symmetric quantization) and serialized with the GPTQ format in 3-bit. The model has been created, tested, and evaluated by The Kaitchup.
16
 
 
 
 
 
17
  Details on the quantization process and how to use the model here: [The Kaitchup](https://kaitchup.substack.com/)
18
 
19
  - **Developed by:** [The Kaitchup](https://kaitchup.substack.com/)
 
8
  license: apache-2.0
9
  ---
10
 
11
+ *Note: vLLM has issues running 3-bit models quantized with AutoRound. The model works fine with Transformers.*
12
 
13
  ## Model Details
14
 
15
  This is [Phi-4-mini-instruct](https://huggingface.co/microsoft/Phi-4-mini-instruct) quantized with [AutoRound](https://github.com/intel/auto-round/tree/main) (symmetric quantization) and serialized with the GPTQ format in 3-bit. The model has been created, tested, and evaluated by The Kaitchup.
16
 
17
+
18
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64b93e6bd6c468ac7536607e/zyIZlKq6mvBvKsKEFDrEm.png)
19
+
20
+
21
  Details on the quantization process and how to use the model here: [The Kaitchup](https://kaitchup.substack.com/)
22
 
23
  - **Developed by:** [The Kaitchup](https://kaitchup.substack.com/)