Zeqiang-Lai / Anything2Image

Generate image from anything with ImageBind and Stable Diffusion
190 stars 23 forks source link

I want to generate audio from image or text, which model should I use? Thanks #1

Closed WilTay1 closed 1 year ago

Zeqiang-Lai commented 1 year ago

I am sorry that this repo currently only contains models for generate image from audio, or other modality data.

For text to audio, you could use https://huggingface.co/docs/diffusers/api/pipelines/audio_diffusion