octava commited on
Commit
69ebad1
·
verified ·
1 Parent(s): 2472314

End of training

Browse files
Files changed (1) hide show
  1. README.md +17 -17
README.md CHANGED
@@ -7,7 +7,7 @@ base_model: openai/whisper-small
7
  tags:
8
  - generated_from_trainer
9
  datasets:
10
- - octava/indonesian-voice-transcription-1.4.9a-cv-fl
11
  metrics:
12
  - wer
13
  model-index:
@@ -17,13 +17,13 @@ model-index:
17
  name: Automatic Speech Recognition
18
  type: automatic-speech-recognition
19
  dataset:
20
- name: Extracted Youtube with CommonVoice11 and Fleurs
21
- type: octava/indonesian-voice-transcription-1.4.9a-cv-fl
22
  args: 'config: id, split: train'
23
  metrics:
24
  - name: Wer
25
  type: wer
26
- value: 19.8005698005698
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -31,10 +31,10 @@ should probably proofread and complete it, then remove this comment. -->
31
 
32
  # Optimized Whisper Small Id for Inspirasi
33
 
34
- This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Extracted Youtube with CommonVoice11 and Fleurs dataset.
35
  It achieves the following results on the evaluation set:
36
- - Loss: 0.3393
37
- - Wer: 19.8006
38
 
39
  ## Model description
40
 
@@ -67,16 +67,16 @@ The following hyperparameters were used during training:
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Wer |
69
  |:-------------:|:------:|:----:|:---------------:|:-------:|
70
- | 0.3919 | 0.1865 | 500 | 0.3850 | 23.0864 |
71
- | 0.2481 | 0.3730 | 1000 | 0.3674 | 22.3077 |
72
- | 0.2087 | 0.5595 | 1500 | 0.3528 | 21.4530 |
73
- | 0.1567 | 0.7460 | 2000 | 0.3456 | 22.1368 |
74
- | 0.1462 | 0.9325 | 2500 | 0.3452 | 20.5603 |
75
- | 0.0676 | 1.1190 | 3000 | 0.3451 | 19.8481 |
76
- | 0.0667 | 1.3055 | 3500 | 0.3433 | 20.0190 |
77
- | 0.0585 | 1.4920 | 4000 | 0.3427 | 20.0570 |
78
- | 0.053 | 1.6785 | 4500 | 0.3407 | 20.0760 |
79
- | 0.0662 | 1.8650 | 5000 | 0.3393 | 19.8006 |
80
 
81
 
82
  ### Framework versions
 
7
  tags:
8
  - generated_from_trainer
9
  datasets:
10
+ - octava/indonesian-voice-transcription-1.4.9a-cv-fl-slrjv-md
11
  metrics:
12
  - wer
13
  model-index:
 
17
  name: Automatic Speech Recognition
18
  type: automatic-speech-recognition
19
  dataset:
20
+ name: Extracted Youtube with CommonVoice11, Fleurs, OpenSLR, and MagicData
21
+ type: octava/indonesian-voice-transcription-1.4.9a-cv-fl-slrjv-md
22
  args: 'config: id, split: train'
23
  metrics:
24
  - name: Wer
25
  type: wer
26
+ value: 19.96201329534663
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
31
 
32
  # Optimized Whisper Small Id for Inspirasi
33
 
34
+ This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Extracted Youtube with CommonVoice11, Fleurs, OpenSLR, and MagicData dataset.
35
  It achieves the following results on the evaluation set:
36
+ - Loss: 0.3376
37
+ - Wer: 19.9620
38
 
39
  ## Model description
40
 
 
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Wer |
69
  |:-------------:|:------:|:----:|:---------------:|:-------:|
70
+ | 0.4122 | 0.1686 | 500 | 0.3999 | 24.8908 |
71
+ | 0.2737 | 0.3373 | 1000 | 0.3655 | 22.4691 |
72
+ | 0.2311 | 0.5059 | 1500 | 0.3491 | 21.5195 |
73
+ | 0.1947 | 0.6745 | 2000 | 0.3339 | 21.5100 |
74
+ | 0.169 | 0.8432 | 2500 | 0.3408 | 20.6363 |
75
+ | 0.0875 | 1.0118 | 3000 | 0.3429 | 21.2726 |
76
+ | 0.0877 | 1.1804 | 3500 | 0.3430 | 20.4748 |
77
+ | 0.0726 | 1.3491 | 4000 | 0.3396 | 20.2469 |
78
+ | 0.0741 | 1.5177 | 4500 | 0.3378 | 20.2754 |
79
+ | 0.0675 | 1.6863 | 5000 | 0.3376 | 19.9620 |
80
 
81
 
82
  ### Framework versions