yl4579 / StyleTTS

Official Implementation of StyleTTS
MIT License
396 stars 64 forks source link

Trying to have the inference demo work - but no success #25

Closed olup closed 1 year ago

olup commented 1 year ago

I am trying to have the inference demo output something intelligible in this collab.

But for now all my efforts are vain. I made sure to upsample the wav for prosody, but still getting rubbish. Could you point me where I am wrong ?

olup commented 1 year ago

I have been pointed to a bug in the notebook's code at this line

the start of the shell should be:

ps = global_phonemizer.phonemize([text])
# ps = word_tokenize(ps[0]) // this line is wrong
tokens = textclenaer(ps[0])
yl4579 commented 1 year ago

Sorry I made a mistake in the update. I have fixed it, can you try if it works now?

olup commented 1 year ago

Hey, thanks, it's effectively now working, and I'm closing my issue! But while i'm at it, would you have pointers on to have better end of sentences ? Model produces no significant pause between sentences and tend to have the wrong intonation at the end.