haoheliu / AudioLDM2

Text-to-Audio/Music Generation
Other
2.31k stars 180 forks source link

How to generate speech condition on not only transcripts and descriptions but also audio clip? #54

Open bigpon opened 1 year ago

bigpon commented 1 year ago

Hi, Thanks for the open-source code. I want to generate speech conditioned on transcripts, descriptions, and audio clips by using the audioldm-gigaspech pre-trained model. However, I found the provided example only accepts transcripts and descriptions. Can you also release the example using not only transcripts and descriptions but also audio clips? or do you have some tips to modify the code to run the speech generation based on transcripts, descriptions, and audio clips?

Thanks in advance.