5roop's picture
Update README.md
f1462b3 verified
|
raw
history blame
1.01 kB
metadata
library_name: transformers
tags:
  - prosody
  - segmentation
  - audio
  - speech
language:
  - sl
base_model:
  - facebook/w2v-bert-2.0

Wav2Vec2Bert Audio frame classifier for prosodic unit detection

This model predicts prosodic units on speech. For each 20ms frame the model predicts a vector like [0,1] or [1,0], indicating whether there is a prosodic unit in this frame or not.

Model Details

Model Description

This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.

  • Developed by: Peter Rupnik, Nikola Ljubešić
  • Funded by: MEZZANINE project
  • Model type: Wav2Vec2Bert for Audio Frame Classification
  • Language(s) (NLP): Trained and tested on Slovenian, ATM unclear if usable cross-lingually
  • Finetuned from model: facebook/w2v-bert-2.0

Uses

Bias, Risks, and Limitations

Training Details

Evaluation