pjlab-songcomposer / songcomposer

133 stars 6 forks source link

How to generate audio #3

Open kily-wmz opened 6 months ago

kily-wmz commented 6 months ago

Hi!

I have successfully run the code for the SFT model, but I was wondering how to convert the output into audio. Is this part of the code publicly available? Thanks!

Mark12Ding commented 6 months ago

Hi,

Actually, we provide the code for parsing the generated string into the midi file at utils.py and an example in the inference.ipynb. You can check it out.

For the audio part, we use the commercial app ACE to convert the midi file into audio. Other vocal synthesizer apps are also capable of doing that such as synthesizerv.

Best, Shuangrui

kily-wmz commented 6 months ago

Hi! I'm glad you could reply! I've successfully converted the output of the SFT model into a MIDI file and played it. But currently, it generates only pure audio. How can I generate an audio file that plays both the melody and the lyrics together? Is there any code available for this part? Looking forward to your reply. Thanks!