本地部署好不容易走到第9步卡住了，求救~

Plachtaa / VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Apache License 2.0

4.64k stars 697 forks source link

本地部署好不容易走到第9步卡住了，求救~ #482

Open Yao202303 opened 8 months ago

Yao202303 commented 8 months ago

python scripts/short_audio_transcribe.py --languages "{PRETRAINED_MODEL}" --whisper_size large

上面这部成功率，日语台词都识别出来了，276条语音显示processed。

运行python scripts/resample.py原本显示没找到sampled_audio4ft？然后我在根目录找到了台词文本.txt，就改名sampled_audio4ft.txt,然后再运行python scripts/resample.py就没任何反应了。跳过这个直接下一步又显示AssertionError: No audio file found. Please check your uploaded file structure. 求大佬赐教！脑壳痛，先睡觉了。。。

ihmily commented 7 months ago

其实你可以不执行这个，然后第10步的时候直接执行

python preprocess_v2.py --languages "{PRETRAINED_MODEL}"

如果你就想要使用这个辅助数据，可以在仓库releases找到inference.rar压缩包，里面有sampled_audio4ft这个压缩包，解压后其中包含sampled_audio4ft文件夹，将其复制到项目根目录

然后就可以正常执行第10步

python preprocess_v2.py --add_auxiliary_data True --languages "{PRETRAINED_MODEL}"

fesolla commented 1 month ago

其实你可以不执行这个，然后第10步的时候直接执行
python preprocess_v2.py --languages "{PRETRAINED_MODEL}"
如果你就想要使用这个辅助数据，可以在仓库releases找到inference.rar压缩包，里面有sampled_audio4ft这个压缩包，解压后其中包含sampled_audio4ft文件夹，将其复制到项目根目录

然后就可以正常执行第10步
python preprocess_v2.py --add_auxiliary_data True --languages "{PRETRAINED_MODEL}"

我去找了inference.rar，不過裡面沒有看到sampled_audio4ft壓縮包