File size: 1,052 Bytes

db4964e
7bbb100
 
c4ce1a9
 
7bbb100
c4ce1a9
7bbb100
c4ce1a9
db4964e
89c3ab5
db4964e
 
 
7bbb100
db4964e
89c3ab5
 
 
 
7bbb100
db4964e
7bbb100
 
 
db4964e
7bbb100

---
language:
- en
library_name: transformers
tags:
- auto-gptq
- AutoRound
license: apache-2.0
---

*Note: vLLM has issues running 3-bit models quantized with AutoRound. The model works fine with Transformers.*

## Model Details

This is [Phi-4-mini-instruct](https://huggingface.co/microsoft/Phi-4-mini-instruct) quantized with [AutoRound](https://github.com/intel/auto-round/tree/main) (symmetric quantization) and serialized with the GPTQ format in 3-bit. The model has been created, tested, and evaluated by The Kaitchup.


![image/png](https://cdn-uploads.huggingface.co/production/uploads/64b93e6bd6c468ac7536607e/zyIZlKq6mvBvKsKEFDrEm.png)


Details on the quantization process and how to use the model here: [The Kaitchup](https://kaitchup.substack.com/)

- **Developed by:** [The Kaitchup](https://kaitchup.substack.com/)
- **Language(s) (NLP):** English
- **License:** Apache 2.0 license

## How to Support My Work
Subscribe to [The Kaitchup](https://kaitchup.substack.com/subscribe)! I release quantized models with the Apache 2.0 license.