Hello there, devs of Style TTS2, it's a great model, you really did a good job.
I mainly use it on the hf demo, but there are some issues:
Firstly, it pauses after the dash - symbol, so please fix it. For example, it reads white-clothed as "White. Clothed".
Secondly, sometimes it does random bursts of distorted noise, skipping words. Can you find a way to fix this? Is this an issue of the pretrained model or the architecture itself? Thanks and regards
Hello there, devs of Style TTS2, it's a great model, you really did a good job. I mainly use it on the hf demo, but there are some issues: Firstly, it pauses after the dash - symbol, so please fix it. For example, it reads white-clothed as "White. Clothed". Secondly, sometimes it does random bursts of distorted noise, skipping words. Can you find a way to fix this? Is this an issue of the pretrained model or the architecture itself? Thanks and regards