Audio-to-Audio
PyTorch
Safetensors

⚡ FocalCodec

A low-bitrate single-codebook 16 kHz speech codec based on focal modulation.

This repository contains the 25 Hz checkpoint trained on LibriTTS 960, as described in the preprint.


▶️ Quickstart

See the readme at: https://github.com/lucadellalib/focalcodec


@ Citing

@article{dellalibera2025focalcodec,
    title   = {{FocalCodec}: Low-Bitrate Speech Coding via Focal Modulation Networks},
    author  = {Luca {Della Libera} and Francesco Paissan and Cem Subakan and Mirco Ravanelli},
    journal = {arXiv preprint arXiv:2502.04465},
    year    = {2025},
}

📧 Contact

[email protected]


Downloads last month
149
Safetensors
Model size
144M params
Tensor type
I64
·
F32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The HF Inference API does not support audio-to-audio models for pytorch library.

Model tree for lucadellalib/focalcodec_25hz

Finetuned
(7)
this model

Dataset used to train lucadellalib/focalcodec_25hz

Collection including lucadellalib/focalcodec_25hz