Pre-trained model - Githubissues

Hi! We just released pre-trained weights. Please check out!

Unfortunately, our cotatron loss graph has been deleted, so it is difficult to upload. However, we can provide a loss graph of the synthesizer(VC decoder). Please see the graph below.

FYI, we trained cotatron 25k steps only with LibriTTS, and 20k steps with LibriTTS and VCTK. The scale of cotatron validation's reconstruction loss was about 0.28.

Lastly, we implemented audio normalization in our project. You can use normalization by changing the norm option in https://github.com/mindslab-ai/assem-vc/blob/master/datasets/text_mel_dataset.py#L17 to True. To use audio normalization in the training process, make sure that the norm option of cotatron.py and synthesizer.py must also be set to True! https://github.com/mindslab-ai/assem-vc/blob/master/cotatron.py#L136

Since we observed that normalization does not affect the output result, we set the norm option to False in this implementation. Thanks!

maum-ai / assem-vc

Pre-trained model #17