Plachtaa / VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Apache License 2.0
4.69k stars 703 forks source link

final_annotation.txt 内容是这样的 #517

Closed scriptboy1990 closed 9 months ago

scriptboy1990 commented 9 months ago
image

./sampled_audio4ft/415.wav|1|həNN↓↑ʧ⁼you↓↑mei↑ you↓↑ tʰa↓s`aNg↓ fəNg→ t⁼ə k⁼wo↑tʰu↓↑ lə…

但是我记得上次生成的是像日文那样的拼音呐,不知道哪一步错了。

scriptboy1990 commented 9 months ago

chinese_cleaners不知道什么时候变成了zh_ja_mixture_cleaners