Spaces:

tinywave
/

README

Configuration error

App Files Files Community

mohammadmahdinouri commited on Jul 1

Commit

de35df9

verified ·

1 Parent(s): aad5f00

Update README.md

Browse files

Files changed (1) hide show

README.md +18 -8

README.md CHANGED Viewed

@@ -1,10 +1,20 @@
 ---
-title: README
-emoji: 🐨
-colorFrom: purple
-colorTo: purple
-sdk: static
-pinned: false
----
-Edit this `README.md` markdown file to author your organization card.

+# 🌊 TinyWave: Compact & Expressive Speech Language Models
+**TinyWave** is a family of **efficient 2B-parameter speech language models** distilled from the 7B SPIRIT-LM teacher. These models support **speech-to-speech** and **interleaved speech–text generation**, optimized for real-time use on **commodity hardware**.
+Built through **layer-aligned knowledge distillation**, TinyWave models retain **93–97%** of their teacher’s performance while using only **⅓ of the parameters** — ideal for use in voice agents, assistive technologies, and edge devices.
+> 📖 Read the paper: [Efficient Interleaved Speech Modeling through Knowledge Distillation (arXiv:2506.23670)](https://arxiv.org/abs/2506.23670)
+> 🌐 Demo & samples: [tinywave-landing](https://mohammadmahdinoori.github.io/tinywave-landing/)
+> 💻 Code: [github.com/mohammadmahdinoori/TinyWave](https://github.com/mohammadmahdinoori/TinyWave)
 ---
+## 🔧 Model Variants
+| Model                                                        | Modality                  | Tokenizer             | Description                                 |
+|--------------------------------------------------------------|---------------------------|------------------------|---------------------------------------------|
+| [`tinywave/speech-base-2b`](https://huggingface.co/tinywave/speech-base-2b) | Speech → Speech           | `spiritlm_base`        | Base phonetic-only speech generation        |
+| [`tinywave/speech-expressive-2b`](https://huggingface.co/tinywave/speech-expressive-2b) | Speech → Expressive Speech | `spiritlm_expressive`   | Includes pitch + style tokens               |
+| [`tinywave/interleaved-expressive-2b`](https://huggingface.co/tinywave/interleaved-expressive-2b) | Text ↔ Speech (interleaved) | `spiritlm_expressive` | Multimodal expressive generation            |
+| [`tinywave/expressive-spirit-lm-interleaved-librilight`](https://huggingface.co/tinywave/expressive-spirit-lm-interleaved-librilight) | Teacher (7B, interleaved) | `spiritlm_expressive` | LoRA-corrected SPIRIT-LM for distillation  |