:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
I read the paper and i was going through the code here. In the paper they've used montreal forced alignment to extract the phoneme durations, i did not see this being aligned in the code and extracted for training. Also some techniques like CWT for extracting pitch? the preprocessing code here simply used pyworld to extract the pitch.
I read the paper and i was going through the code here. In the paper they've used montreal forced alignment to extract the phoneme durations, i did not see this being aligned in the code and extracted for training. Also some techniques like CWT for extracting pitch? the preprocessing code here simply used pyworld to extract the pitch.