Ahmadzei's picture
update 1
57bdca5
raw
history blame contribute delete
160 Bytes
The best starting point would be to have a look at another quantization methods such as quantizer_awq.py.
Write the _process_model_after_weight_loading method.