enlyth / sd-webui-riffusion

Riffusion extension for AUTOMATIC1111's SD Web UI
MIT License
195 stars 23 forks source link

Riffusion extension for AUTOMATIC1111 Web UI

Screenshot

Screenshot

Installation

Usage

Select the Riffusion Audio Generator script before generating, and use the riffusion model.

You can also convert a whole folder of images to audio in the Riffusion tab.

Prompt Travelling

If you want to prompt travel in the latent space as described by the authors, install this extension:

https://github.com/Kahsolt/stable-diffusion-webui-prompt-travel

It will output the results of runs in the <SD>/outputs/(txt|img)2img-images/prompt_travel/ directory, and you can use the convert folder to audio functionality in the Riffusion tab to generate a single stitched-together audio file alongside the individual ones.

Here is a sample made by travelling in img2img mode from jamaican rap to deep house, techno with denoise 0.5 for 14 steps, and using the og_beat.png provided by the original authors as a base image:

Audio Sample (Jamaican Rap to Deep House, Techno)

Acknowledgements

Credit to the original Riffusion authors, Seth Forsgren and Hayk Martiros:

https://riffusion.com/about