-
Neural Machine Translation of Rare Words with Subword Units
Paper • 1508.07909 • Published • 4 -
Attention Is All You Need
Paper • 1706.03762 • Published • 50 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 16 -
Generating Wikipedia by Summarizing Long Sequences
Paper • 1801.10198 • Published • 3
Collections
Discover the best community collections!
Collections including paper arxiv:1810.04805