about Loss_T - Githubissues

iamycy / diffwave-sr

MIT License

79 stars 8 forks source link

Hi @FlyToYourMooN, thanks for asking.

Could you provide some generated audio samples?

The loss_T looks normal, but loss (the negative ELBO) looks higher than usual (in my experiments the loss should be around -5.6 at 240k).

You can also check out the repo https://github.com/yoyololicon/duet-svs-diffusion. We used the 1D UNet from https://github.com/archinetai/audio-diffusion-pytorch as a denoiser (which is stronger than the noncausal wavenet of diffwave) and trained it on 8 singing voice datasets (including OpenSinger). We also made the checkpoint available.

I hope this helps.

iamycy / diffwave-sr