File size: 160 Bytes
57bdca5
 
 
1
2
3
The best starting point would be to have a look at another quantization methods such as quantizer_awq.py.

Write the _process_model_after_weight_loading method.