jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
https://jaywalnut310.github.io/vits-demo/index.html
MIT License
6.53k stars 1.22k forks source link

How to get duration of each phoneme. #69

Open tuannvhust opened 1 year ago

tuannvhust commented 1 year ago

We know that Stochastic Duration Prediction is used to synthesize speech with different pitches and rhythms. And how we can get duration of each phoneme.

wildBigPanda commented 6 months ago

Hi,I have the same problem on get duration of each phoneme,did you resolve it?