library_name: transformers | |
license: mit | |
datasets: | |
- domce20/c4-lithuanian-enhanced | |
- allenai/c4 | |
language: | |
- lt | |
# Model Card for Model ID | |
GPT-2 based model trained for Lithuanian. | |
### Model Description | |
The model architecture is copied from the ai-forever/mGPT model, however it is trained from scratch on a modified partition of the Lithuanian partition of the mC4 dataset. | |
The training was done on Vilnius University supercomputer. |