seungwonpark / melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)
http://swpark.me/melgan/
BSD 3-Clause "New" or "Revised" License
633 stars 116 forks source link

Strange inference results with pretrained 6400epoch model #48

Open ghost opened 4 years ago

ghost commented 4 years ago

Thank you for your implementation and effort. I have question about inference and getting test samples from your pretrained model. I am running training, preprocess and inference with no problem on my Ubuntu machine. But results are strange, i cannot repeat your samples from 6400epoch trained model.

Am I missing something crucial? This model can generate unconditional audio? What is expected to be mel input for inference? Can your implementation generate audio translation?

My generated test samples and config files are in folder:https://drive.google.com/drive/folders/1zRhTFP7GepXrm_DPHkF1Nt94LZXMBX4Z?usp=sharing