Update README.md
Browse files
README.md
CHANGED
@@ -49,23 +49,22 @@ PROPICTO ANR-20-CE93-0005
|
|
49 |
|
50 |
## Training Details
|
51 |
|
52 |
-
|
53 |
### Training Data
|
54 |
|
55 |
-
|
|
|
|
|
|
|
|
|
56 |
|
57 |
### Training Procedure
|
58 |
|
59 |
-
|
60 |
-
|
61 |
-
#### Preprocessing [optional]
|
62 |
-
|
63 |
-
[More Information Needed]
|
64 |
-
|
65 |
|
66 |
#### Training Hyperparameters
|
67 |
|
68 |
-
|
69 |
|
70 |
#### Speeds, Sizes, Times [optional]
|
71 |
|
|
|
49 |
|
50 |
## Training Details
|
51 |
|
|
|
52 |
### Training Data
|
53 |
|
54 |
+
We use the train / valid / test splits provided by CommonVoice, which corresponds to:
|
55 |
+
| | Train | Valid | Test |
|
56 |
+
|:-------------:|:-------------:|:--------------:|:--------------:|
|
57 |
+
| # utterances | 527,554 | 16,132 | 16,132 |
|
58 |
+
| # hours | 756.19 | 25.84 | 26.11 |
|
59 |
|
60 |
### Training Procedure
|
61 |
|
62 |
+
We follow the training procedure provided in the (ASR-CTC speechbrain recipe)[https://github.com/speechbrain/speechbrain/tree/develop/recipes/CommonVoice/ASR/CTC].
|
63 |
+
The `common_voice_prepare.py` script handles the preprocessing of the dataset.
|
|
|
|
|
|
|
|
|
64 |
|
65 |
#### Training Hyperparameters
|
66 |
|
67 |
+
|
68 |
|
69 |
#### Speeds, Sizes, Times [optional]
|
70 |
|