README.md · nielsr/csm-1b at main

metadata

license: apache-2.0
pipeline_tag: text-to-speech
tags:
  - model_hub_mixin
  - pytorch_model_hub_mixin

This model has been pushed to the Hub using the PytorchModelHubMixin integration:

Library: https://github.com/SesameAILabs/csm

Installation

First install from here:

git clone -b add_hf https://github.com/NielsRogge/csm.git
cd csm
pip install -r requirements.txt

Usage

import torchaudio
from generator import load_csm_1b

generator = load_csm_1b(device="cuda")

audio = generator.generate(
    text="Hello from Sesame.",
    speaker=0,
    context=[],
    max_audio_length_ms=10_000,
)

torchaudio.save("audio.wav", audio.unsqueeze(0).cpu(), generator.sample_rate)