Closed Jiaxin-Ye closed 1 month ago
Hi! Thank you for your awesome work! I am a freshman on TTS, and I don't see any text-speech alignment method on this project. I wonder whether the T5 model can automatically upsample the semantics token?
Hi! Thank you for your awesome work! I am a freshman on TTS, and I don't see any text-speech alignment method on this project. I wonder whether the T5 model can automatically upsample the semantics token?