Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
soham97
/
mellow
like
2
small audio-language model
ALM
audio
music
sound events
audio reasoning
audio captioning
audio question answering
zero-shot
audio-text
arxiv:
2503.08540
License:
mit
Model card
Files
Files and versions
Community
2c6e7ae
mellow
/
resource
1 contributor
History:
1 commit
soham97
first
2c6e7ae
20 days ago
data.png
492 kB
LFS
first
20 days ago
image.png
4.92 MB
LFS
first
20 days ago