metadata
library_name: transformers
tags:
- prosody
- segmentation
- audio
- speech
language:
- sl
base_model:
- facebook/w2v-bert-2.0
Wav2Vec2Bert Audio frame classifier for prosodic unit detection
This model predicts prosodic units on speech.
For each 20ms frame the model predicts a vector like [0,1] or [1,0], indicating whether there is a prosodic unit in
this frame or not.
Model Details
Model Description
This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
- Developed by: Peter Rupnik, Nikola Ljubešić
- Funded by: MEZZANINE project
- Model type: Wav2Vec2Bert for Audio Frame Classification
- Language(s) (NLP): Trained and tested on Slovenian, ATM unclear if usable cross-lingually
- Finetuned from model: facebook/w2v-bert-2.0