pablo
add diffusers fork
a63d2a4

A newer version of the Gradio SDK is available: 5.23.3

Upgrade

Audio Diffusion

Audio Diffusion is by Robert Dargavel Smith, and it leverages the recent advances in image generation from diffusion models by converting audio samples to and from Mel spectrogram images.

The original codebase, training scripts and example notebooks can be found at teticio/audio-diffusion.

Make sure to check out the Schedulers guide to learn how to explore the tradeoff between scheduler speed and quality, and see the reuse components across pipelines section to learn how to efficiently load the same components into multiple pipelines.

AudioDiffusionPipeline

[[autodoc]] AudioDiffusionPipeline - all - call

AudioPipelineOutput

[[autodoc]] pipelines.AudioPipelineOutput

ImagePipelineOutput

[[autodoc]] pipelines.ImagePipelineOutput

Mel

[[autodoc]] Mel