wav2vecbert2-prosodicUnit / README.md

5roop

Update README.md

f1462b3 verified 10 months ago

preview code

raw

history blame

1.01 kB

metadata

library_name: transformers
tags:
  - prosody
  - segmentation
  - audio
  - speech
language:
  - sl
base_model:
  - facebook/w2v-bert-2.0

Wav2Vec2Bert Audio frame classifier for prosodic unit detection

This model predicts prosodic units on speech. For each 20ms frame the model predicts a vector like [0,1] or [1,0], indicating whether there is a prosodic unit in this frame or not.

Model Details

Model Description

This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.

Developed by: Peter Rupnik, Nikola Ljubešić
Funded by: MEZZANINE project
Model type: Wav2Vec2Bert for Audio Frame Classification
Language(s) (NLP): Trained and tested on Slovenian, ATM unclear if usable cross-lingually
Finetuned from model: facebook/w2v-bert-2.0

classla
/

wav2vecbert2-prosodicUnit

Wav2Vec2Bert Audio frame classifier for prosodic unit detection

Model Details

Model Description

Uses

Bias, Risks, and Limitations

Training Details

Evaluation