Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
In this case, we prefer to only support inference in Transformers and let the third-party library maintained by the ML community deal with the model quantization itself.