APMplusplus / falkon

Towards an ecosystem of tasks and concepts related to Language Technologies
Apache License 2.0
3 stars 3 forks source link

Resynthesis leads to corrupted output #16

Open saikrishnarallabandi opened 5 years ago

saikrishnarallabandi commented 5 years ago

It seems to get corrupted on extraction and resynthesis Music becomes noise

ADHBSU/aligned/wav/B01_01_Matthew___ADHBSUN2DA_00005.wav

saikrishnarallabandi commented 5 years ago

@saikrishnarallabandi

saikrishnarallabandi commented 5 years ago

@nishantgurunath feel free to add more details

saikrishnarallabandi commented 5 years ago

@nishantgurunath cud confirm the issue.

Resynthesized file here: http://tts.speech.cs.cmu.edu/rsk/misc_stuff/B01___01_Matthew_____ADHBSUN2DA_00005_reconstructed.wav

Looks like this is due to the spectral representation we are using. Let me look at other representations

saikrishnarallabandi commented 5 years ago

Extracting features at different frame shifts doesnt resolve the issue:

1 msec: http://tts.speech.cs.cmu.edu/rsk/misc_stuff/B01___01_Matthew_____ADHBSUN2DA_00005_reconstructed_1msec.wav

20 msec: http://tts.speech.cs.cmu.edu/rsk/misc_stuff/B01___01_Matthew_____ADHBSUN2DA_00005_reconstructed_20msec.wav