Commit
·
30744db
1
Parent(s):
2e58ed8
Update README.md
Browse files
README.md
CHANGED
@@ -106,7 +106,7 @@ Full config can be found inside the .nemo files.
|
|
106 |
|
107 |
### Datasets
|
108 |
|
109 |
-
All the models were trained on
|
110 |
|
111 |
- Train set: ~250 hours.
|
112 |
- Dev set: ~25 hours.
|
|
|
106 |
|
107 |
### Datasets
|
108 |
|
109 |
+
All the models were trained on Mozilla Common Voice Esperanto 11.0 dataset comprising of about 1400 validated hours of Esperanto speech. However, training set consists of a much smaller amount of data, because when forming the train.tsv, dev.tsv and test.tsv, repetitions of texts in train were removed by Mozilla developers.
|
110 |
|
111 |
- Train set: ~250 hours.
|
112 |
- Dev set: ~25 hours.
|