metadata
license: mit
language:
- sr
metrics:
- wer
base_model:
- facebook/wav2vec2-large-xlsr-53
pipeline_tag: audio-classification
tags:
- voice-command
- smart-home
- voice-controller
- serbian-wav2vec
- audio-classification
Model Card for Model ID
Smart home controller simulator, receiving voice commands from a microphone. Trained to detect the words: "vrata", "svetlo", "zvuk", "otvori", "zatvori", "uključi" and "isključi" to control the state of door, lights and audio in a smart home system.
Model Details
Model Description
- Developed by: Mihailo Radović
- Model type: Audio Classification (Smart home controller)
- Language(s) (NLP): Serbian
- License: MIT
- Finetuned from model: facebook/wav2vec2-large-xlsr-53
Model Sources
- Repository: GitHub Repo
- Demo: Streamlit App
Uses
Direct Use
Detecting a word said in a short audio clip: "vrata", "svetlo", "zvuk", "otvori", "zatvori", "uključi" and "isključi" to control the state of door, lights and audio in a smart home system.
Out-of-Scope Use
Works the best for the words that are in the dataset. For the words that are out of the vocabulary, the DTW check is implemented (see code in GitHub Repo).
How to Get Started with the Model
Check out the explaination in my GitHub repository README file.