cecilemacaire commited on
Commit
44b2307
·
verified ·
1 Parent(s): a5e5574

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -9
README.md CHANGED
@@ -49,23 +49,22 @@ PROPICTO ANR-20-CE93-0005
49
 
50
  ## Training Details
51
 
52
-
53
  ### Training Data
54
 
55
-
 
 
 
 
56
 
57
  ### Training Procedure
58
 
59
-
60
-
61
- #### Preprocessing [optional]
62
-
63
- [More Information Needed]
64
-
65
 
66
  #### Training Hyperparameters
67
 
68
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
69
 
70
  #### Speeds, Sizes, Times [optional]
71
 
 
49
 
50
  ## Training Details
51
 
 
52
  ### Training Data
53
 
54
+ We use the train / valid / test splits provided by CommonVoice, which corresponds to:
55
+ | | Train | Valid | Test |
56
+ |:-------------:|:-------------:|:--------------:|:--------------:|
57
+ | # utterances | 527,554 | 16,132 | 16,132 |
58
+ | # hours | 756.19 | 25.84 | 26.11 |
59
 
60
  ### Training Procedure
61
 
62
+ We follow the training procedure provided in the (ASR-CTC speechbrain recipe)[https://github.com/speechbrain/speechbrain/tree/develop/recipes/CommonVoice/ASR/CTC].
63
+ The `common_voice_prepare.py` script handles the preprocessing of the dataset.
 
 
 
 
64
 
65
  #### Training Hyperparameters
66
 
67
+
68
 
69
  #### Speeds, Sizes, Times [optional]
70