Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Some good examples of this field are:
BERT Rediscovers the Classical NLP Pipeline by Ian Tenney, Dipanjan Das, Ellie Pavlick:
https://arxiv.org/abs/1905.05950
Are Sixteen Heads Really Better than One?