I have been facing a weird issue while testing. I successfully trained the SEGAN model for 19440 iterations for a batch size of 100. During training at the save_freq the max and min values of the generated sample audios are printed. Here, almost all the audio files vary from +0.55.... to -0.5....
Now, during testing for the same audio file in the training set for the same weights, the output behave like this:
The generated wav sounds even noisier than before and the speech segments sound extremely loud and distorted. I have no idea why this would be happening? Need some help please.
I solved the issue! I had added virtual batch normalization (VBN) for the generator as well... I removed VBN from the last layer of the generator and it worked well.
I have been facing a weird issue while testing. I successfully trained the SEGAN model for 19440 iterations for a batch size of 100. During training at the
save_freq
the max and min values of the generated sample audios are printed. Here, almost all the audio files vary from +0.55.... to -0.5....Now, during testing for the same audio file in the training set for the same weights, the output behave like this:
The generated wav sounds even noisier than before and the speech segments sound extremely loud and distorted. I have no idea why this would be happening? Need some help please.
Originally posted by @HusainKapadia in https://github.com/santi-pdp/segan/issues/38#issuecomment-432590310