Propicto
/

asr-wav2vec2-commonvoice-15-fr

Automatic Speech Recognition

Model card Files Files and versions Community

cecilemacaire commited on Jan 15

Commit

44b2307

·

verified ·

1 Parent(s): a5e5574

Update README.md

Files changed (1) hide show

README.md +8 -9

README.md CHANGED Viewed

@@ -49,23 +49,22 @@ PROPICTO ANR-20-CE93-0005
 ## Training Details
 ### Training Data
 ### Training Procedure
-#### Preprocessing [optional]
-[More Information Needed]
 #### Training Hyperparameters
-- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
 #### Speeds, Sizes, Times [optional]

 ## Training Details
 ### Training Data
+We use the train / valid / test splits provided by CommonVoice, which corresponds to:
+| | Train | Valid | Test |
+|:-------------:|:-------------:|:--------------:|:--------------:|
+| # utterances | 527,554 | 16,132 | 16,132 |
+| # hours | 756.19 | 25.84 | 26.11 |
 ### Training Procedure
+We follow the training procedure provided in the (ASR-CTC speechbrain recipe)[https://github.com/speechbrain/speechbrain/tree/develop/recipes/CommonVoice/ASR/CTC].
+The `common_voice_prepare.py` script handles the preprocessing of the dataset.
 #### Training Hyperparameters
 #### Speeds, Sizes, Times [optional]