Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
This means the _process_model_before_weight_loading method takes care of manipulating the model skeleton to replace some modules (e.g., nn.Linear) with the target modules (quantization modules).