license: apache-2.0 | |
datasets: | |
- wmt/wmt14 | |
language: | |
- de | |
- en | |
pipeline_tag: text2text-generation | |
This is a huggingface port of the [PyTorch implementation of the original transformer](https://github.com/ubaada/scratch-transformer) model from 2017 introduced in the paper "[Attention Is All You Need](https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf)". This is the 65M parameter base model version trained to do English-to-German translations. |