Update README.md
Browse files
README.md
CHANGED
@@ -8,12 +8,16 @@ tags:
|
|
8 |
license: apache-2.0
|
9 |
---
|
10 |
|
11 |
-
|
12 |
|
13 |
## Model Details
|
14 |
|
15 |
This is [Phi-4-mini-instruct](https://huggingface.co/microsoft/Phi-4-mini-instruct) quantized with [AutoRound](https://github.com/intel/auto-round/tree/main) (symmetric quantization) and serialized with the GPTQ format in 3-bit. The model has been created, tested, and evaluated by The Kaitchup.
|
16 |
|
|
|
|
|
|
|
|
|
17 |
Details on the quantization process and how to use the model here: [The Kaitchup](https://kaitchup.substack.com/)
|
18 |
|
19 |
- **Developed by:** [The Kaitchup](https://kaitchup.substack.com/)
|
|
|
8 |
license: apache-2.0
|
9 |
---
|
10 |
|
11 |
+
*Note: vLLM has issues running 3-bit models quantized with AutoRound. The model works fine with Transformers.*
|
12 |
|
13 |
## Model Details
|
14 |
|
15 |
This is [Phi-4-mini-instruct](https://huggingface.co/microsoft/Phi-4-mini-instruct) quantized with [AutoRound](https://github.com/intel/auto-round/tree/main) (symmetric quantization) and serialized with the GPTQ format in 3-bit. The model has been created, tested, and evaluated by The Kaitchup.
|
16 |
|
17 |
+
|
18 |
+

|
19 |
+
|
20 |
+
|
21 |
Details on the quantization process and how to use the model here: [The Kaitchup](https://kaitchup.substack.com/)
|
22 |
|
23 |
- **Developed by:** [The Kaitchup](https://kaitchup.substack.com/)
|