Plachtaa / VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Apache License 2.0
4.7k stars 705 forks source link

明明上传了文件,结果google colab的步骤3.5出现了问题 #571

Open wodhei opened 8 months ago

wodhei commented 8 months ago

AssertionError Traceback (most recent call last) /content/VITS-fast-fine-tuning/preprocess_v2.py in 34 if speaker not in speakers: 35 speakers.append(speaker) ---> 36 assert (len(speakers) != 0), "No audio file found. Please check your uploaded file structure." 37 # Source 3 (Optional): sampled audios as extra training helpers 38 if args.add_auxiliary_data:

AssertionError: No audio file found. Please check your uploaded file structure.

wodhei commented 8 months ago

跟这两个错误有关吗,这是执行1步骤的时候发生的错误。 202402 202403

wodhei commented 8 months ago

以及这些 202404

AI-Satori commented 8 months ago

昨天遇到了同样的问题

fesolla commented 4 months ago

同樣的問題,用雲端硬碟還是本地檔案都不行

YongXie66 commented 4 months ago

同樣的問題,用雲端硬碟還是本地檔案都不行

相同问题,请问你解决了吗,我发现是检测spoken language时报的错 image

fesolla commented 4 months ago

同樣的問題,用雲端硬碟還是本地檔案都不行

相同问题,请问你解决了吗,我发现是检测spoken language时报的错 image 我之前換另外一個google帳號,莫名其妙就好了

不過現在我改在本地端安裝,又遇到一樣的問題。 AssertionError: No audio file found. Please check your uploaded file structure.

YongXie66 commented 4 months ago

同樣的問題,用雲端硬碟還是本地檔案都不行

相同问题,请问你解决了吗,我发现是检测spoken language时报的错 image 我之前換另外一個google帳號,莫名其妙就好了

不過現在我改在本地端安裝,又遇到一樣的問題。 AssertionError: No audio file found. Please check your uploaded file structure.

我解决了,当你进行short_audio_transcribe.py时,把whisper_size换成medium,就不会有维度不匹配的报错了!但是我还没弄懂为什么会这样~