nii-yamagishilab / multi-speaker-tacotron

VCTK multi-speaker tacotron for ICASSP 2020
BSD 3-Clause "New" or "Revised" License
265 stars 41 forks source link

Feature generation for given text #8

Open chuber11 opened 3 years ago

chuber11 commented 3 years ago

Hi,

Great work you did. I have a question. Can you provide a script to extract features for a given text that can be used as input for the predictmel script (provided one has the phonemes from flite). I couldnt get it to work (normally i dont use tensorflow). I also wonder how to get from the phonemes to the token ids. Is that part of the feature extraction script?

Best, Christian