Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
🤗 Transformers provides a different model head for each task as long as a model supports the task (i.e., you can't use DistilBERT for a sequence-to-sequence task like translation).