Plachtaa / VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Apache License 2.0
4.65k stars 698 forks source link

vits推理的时候怎么得到每个音素的时长? #570

Open wildBigPanda opened 6 months ago

wildBigPanda commented 6 months ago

vits推理的时候怎么得到每个音素的时长?或者有没有什么办法可以得到文本对应的时间戳,有没有大神知道?