Something looks strange at the MOS table presented in the paper. The WaveGlow score is 4.57 ± 0.04 and is only 0.05 lower than GroundTruth. Did anyone understand why?
The samples shared for SqueezeWave are also of lower quality than expected, sounding a bit robotic even at the 128L model. The WaveGlow results shared by NVIDIA sounds much better to me.
Something looks strange at the MOS table presented in the paper. The WaveGlow score is 4.57 ± 0.04 and is only 0.05 lower than GroundTruth. Did anyone understand why?
The samples shared for SqueezeWave are also of lower quality than expected, sounding a bit robotic even at the 128L model. The WaveGlow results shared by NVIDIA sounds much better to me.