--- license: apache-2.0 base_model: - microsoft/wavlm-large pipeline_tag: audio-to-audio datasets: - mythicinfinity/libritts library_name: pytorch --- # ⚡ FocalCodec A low-bitrate single-codebook 16 kHz speech codec based on [focal modulation](https://arxiv.org/abs/2203.11926). This repository contains the **25 Hz checkpoint** trained on **LibriTTS 960**, as described in the preprint. - 📜 **Preprint**: https://arxiv.org/abs/2502.04465 - 🌐 **Project Page**: https://lucadellalib.github.io/focalcodec-web/ - 💾 **GitHub**: https://github.com/lucadellalib/focalcodec --------------------------------------------------------------------------------------------------------- ## ▶️ Quickstart See the readme at: https://github.com/lucadellalib/focalcodec --------------------------------------------------------------------------------------------------------- ## @ Citing ``` @article{dellalibera2025focalcodec, title = {{FocalCodec}: Low-Bitrate Speech Coding via Focal Modulation Networks}, author = {Luca {Della Libera} and Francesco Paissan and Cem Subakan and Mirco Ravanelli}, journal = {arXiv preprint arXiv:2502.04465}, year = {2025}, } ``` --------------------------------------------------------------------------------------------------------- ## 📧 Contact [luca.dellalib@gmail.com](mailto:luca.dellalib@gmail.com) ---------------------------------------------------------------------------------------------------------