soham97
/

mellow

small audio-language model

audio reasoning

audio captioning

audio question answering

Model card Files Files and versions Community

soham97 commited on Mar 10

Commit

0e6a16e

·

1 Parent(s): fce207b

first

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ tags:
   - audio-text
 ---
 # Mellow
-[[`Paper`]()] [[`GitHub`](https://github.com/soham97/Mellow)] [[`Checkpoint`](https://huggingface.co/soham97/Mellow)]
 Mellow is a small Audio-Language Model that takes in two audios and a text prompt as input and produces free-form text as output. It is a 167M parameter model and trained on ~155 hours of audio (AudioCaps and Clotho), and achieves SoTA performance on different tasks with 50x fewer parameters.
@@ -43,8 +43,8 @@ python example.py
 ## Usage
 The MellowWrapper class allows easy interaction with the model. To use the wrapper, inputs required are:
-- `config`: The option supported is "conf.yaml"
-- `model`: The option supported is "v0.ckpt"
 - `examples`: List of examples. Each example is a list containing three entries: audiopath1, audiopath2, prompt
 Supported functions:

   - audio-text
 ---
 # Mellow
+[[`Paper`]()] [[`GitHub`](https://github.com/soham97/Mellow)] [[`🤗Checkpoint`](https://huggingface.co/soham97/Mellow)] [[`Zenodo`](https://huggingface.co/soham97/Mellow)]
 Mellow is a small Audio-Language Model that takes in two audios and a text prompt as input and produces free-form text as output. It is a 167M parameter model and trained on ~155 hours of audio (AudioCaps and Clotho), and achieves SoTA performance on different tasks with 50x fewer parameters.
 ## Usage
 The MellowWrapper class allows easy interaction with the model. To use the wrapper, inputs required are:
+- `config`: The option supported is "v0"
+- `model`: The option supported is "v0"
 - `examples`: List of examples. Each example is a list containing three entries: audiopath1, audiopath2, prompt
 Supported functions: