Model Card for Model ID

Smart home controller simulator, receiving voice commands from a microphone. Trained to detect the words: "vrata", "svetlo", "zvuk", "otvori", "zatvori", "uključi" and "isključi" to control the state of door, lights and audio in a smart home system.

Model Details

Model Description

  • Developed by: Mihailo Radović
  • Model type: Audio Classification (Smart home controller)
  • Language(s) (NLP): Serbian
  • License: MIT
  • Finetuned from model: facebook/wav2vec2-large-xlsr-53

Model Sources

Uses

Direct Use

Detecting a word said in a short audio clip: "vrata", "svetlo", "zvuk", "otvori", "zatvori", "uključi" and "isključi" to control the state of door, lights and audio in a smart home system.

Out-of-Scope Use

Works the best for the words that are in the dataset. For the words that are out of the vocabulary, the DTW check is implemented (see code in GitHub Repo).

How to Get Started with the Model

Check out the explaination in my GitHub repository README file.

Downloads last month
1,229
Safetensors
Model size
315M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for mradovic38/wav2vec2-large-xlsr-53-serbian-smart-home-commands

Finetuned
(249)
this model