GPT2-Lithuanian / README.md
domce20's picture
Update README.md
e6a4731 verified
metadata
library_name: transformers
license: mit
datasets:
  - domce20/c4-lithuanian-enhanced
  - allenai/c4
language:
  - lt

Model Card for Model ID

GPT-2 based model trained for Lithuanian.

Model Description

The model architecture is copied from the ai-forever/mGPT model, however it is trained from scratch on a modified partition of the Lithuanian partition of the mC4 dataset.

The training was done on Vilnius University supercomputer.