Open latentcode opened 4 years ago
I got similar issues when I first ran it, now I can generate audio correctly and please try:
Then I "docker commit" to a new image for my future use.
torch.nn.Module.dump_patches = True
and use the patch tool to revert the changes.
warnings.warn(msg, SourceChangeWarning)So I added: torch.nn.Module.dump_patches = True to inference.py
also for WaveGlow checkpoint, I tried your first one too but it only generated silent audio, I found this can work: https://ngc.nvidia.com/catalog/models/nvidia:waveglow_ljs_256channels
Now I can generate good audio.
I was able to get it working on Ubuntu without NGC by using Waveglow V3.
Then I tried the exact same thing on Windows and I can't get it to work. It's just silence every time.
your issue can be related to the pytorch version. make sure you're running the latest version and try inference in fp32.
I tried running the "Inference demo". The spectrograms seem reasonable (attached), but the wav file that was generated (sid0_sigma0.5.wav, 367 KB) has no sound. What follows is a list of all the issues I had, offered in the context of improvement for all, not to be construed as lack of appreciation for the work that was done so far! Thank you BTW.
1) I used this Docker image: nvcr.io/nvidia/pytorch:20.09-py3, but it's missing tensorboardX. NOTE: I've also run waveglow, but the TOT version of that repo doesn't work in this Docker. It leads to this error:
I had success running waveglow with: nvcr.io/nvidia/pytorch:20.03-py3
2) The instructions indicate:
but waveglow_256channels_v4.pt is not provided. I found a few work arounds. I tried models from:
I didn't get a valid .wav file with any of these.
I updated inference.py: