license: cc-by-nc-4.0 | |
datasets: | |
- mozilla-foundation/common_voice_11_0 | |
language: | |
- fr | |
- es | |
- pt | |
- da | |
- de | |
- nl | |
- fy | |
- zh | |
- ja | |
- ar | |
- sw | |
- gn | |
library_name: fairseq | |
HUTTER-12: H(uBERT) UTTER model covering 12 languages: | |
Total training hours: 1,622 | |
Number of updates: 400k | |
Number of iterations: 3 | |
Clustering approach: mini-batch K-means (100% of the data) | |