File size: 1,590 Bytes
a3c0c78 883d070 a3c0c78 84403d1 883d070 a3c0c78 2df817d 883d070 bab6668 883d070 44253e2 bab6668 44253e2 a3c0c78 f4b5de0 38b25a9 29ae0cf 38b25a9 883d070 6d16672 55b6dc0 883d070 f025181 241f3a1 9e2e8f1 f73f733 1bcb20e 8b769d4 44253e2 8b769d4 89b421a |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 |
---
base_model: facebook/wav2vec2-xls-r-300m
language:
- uk
license: "apache-2.0"
tags:
- automatic-speech-recognition
datasets:
- mozilla-foundation/common_voice_10_0
metrics:
- wer
model-index:
- name: w2v-xls-r-uk
results:
- task:
name: Automatic Speech Recognition
type: automatic-speech-recognition
dataset:
name: common_voice_10_0
type: common_voice_10_0
config: uk
split: test
args: uk
metrics:
- name: WER
type: wer
value: 20.24
- name: CER
type: cer
value: 3.64
---
π¨π¨π¨ **ATTENTION!** π¨π¨π¨
**Use an updated model**: https://huggingface.co/Yehor/w2v-bert-uk-v2.1
---
## Community
- Discord: https://bit.ly/discord-uds
- Speech Recognition: https://t.me/speech_recognition_uk
- Speech Synthesis: https://t.me/speech_synthesis_uk
See other Ukrainian models: https://github.com/egorsmkv/speech-recognition-uk
## Evaluation results
Metrics (float16) using `evaluate` library with `batch_size=1`:
- WER: 0.2024 metric, 20.24%
- CER: 0.0364 metric, 3.64%
- Accuracy on words: 79.76%
- Accuracy on chars: 96.36%
- Inference time: 63.4848 seconds
- Audio duration: 16665.5212 seconds
- RTF: 0.0038
## Cite this work
```
@misc {smoliakov_2025,
author = { {Smoliakov} },
title = { w2v-xls-r-uk (Revision 55b6dc0) },
year = 2025,
url = { https://huggingface.co/Yehor/w2v-xls-r-uk },
doi = { 10.57967/hf/4556 },
publisher = { Hugging Face }
}
```
|