Plachtaa / VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Apache License 2.0
4.69k stars 705 forks source link

第三部 错误 XDG_RUNTIME_DIR #466

Open 0ran opened 11 months ago

0ran commented 11 months ago

文件名 ranran_01.wav 一直到 ranran_66.wav 时长都是8秒左右

error: XDG_RUNTIME_DIR not set in the environment.
ALSA lib confmisc.c:855:(parse_card) cannot find card '0'
ALSA lib conf.c:5178:(_snd_config_evaluate) function snd_func_card_inum returned error: No such file or directory
ALSA lib confmisc.c:422:(snd_func_concat) error evaluating strings
ALSA lib conf.c:5178:(_snd_config_evaluate) function snd_func_concat returned error: No such file or directory
ALSA lib confmisc.c:1334:(snd_func_refer) error evaluating name
ALSA lib conf.c:5178:(_snd_config_evaluate) function snd_func_refer returned error: No such file or directory
ALSA lib conf.c:5701:(snd_config_expand) Evaluate error: No such file or directory
ALSA lib pcm.c:2664:(snd_pcm_open_noupdate) Unknown PCM default
ALSA lib confmisc.c:855:(parse_card) cannot find card '0'
ALSA lib conf.c:5178:(_snd_config_evaluate) function snd_func_card_inum returned error: No such file or directory
ALSA lib confmisc.c:422:(snd_func_concat) error evaluating strings
ALSA lib conf.c:5178:(_snd_config_evaluate) function snd_func_concat returned error: No such file or directory
ALSA lib confmisc.c:1334:(snd_func_refer) error evaluating name
ALSA lib conf.c:5178:(_snd_config_evaluate) function snd_func_refer returned error: No such file or directory
ALSA lib conf.c:5701:(snd_config_expand) Evaluate error: No such file or directory
ALSA lib pcm.c:2664:(snd_pcm_open_noupdate) Unknown PCM default
Warning: no long audios & videos found, this IS expected if you have only uploaded short audios
this IS NOT expected if you have uploaded any long audios, videos or video links. Please check your file structure or make sure your audio/video language is supported.
Warning: no short audios found, this IS expected if you have only uploaded long audios, videos or video links.
this IS NOT expected if you have uploaded a zip file of short audios. Please check your file structure or make sure your audio language is supported.
Lenubolim commented 11 months ago

遇到了一样的问题。。。

Qianquqwq commented 10 months ago

同,换样本换模型也是这样

Lenubolim commented 10 months ago

同,换样本换模型也是这样

这个问题我解决了,我的是样本问题,重新录制后没问题了

DogeLord081 commented 10 months ago

同,换样本换模型也是这样

这个问题我解决了,我的是样本问题,重新录制后没问题了

你是什​​么意思? 如何更改样品?

Lu233 commented 10 months ago

any official solution for this issue? I got the same with Colab.

DogeLord081 commented 10 months ago

For anyone in the future, you can fix this issue by changing line 19 in scripts/short_audio_transcribe.py to:

mel = whisper.log_mel_spectrogram(audio, n_mels=128).to(model.device)

deepKnight commented 8 months ago

https://github.com/pytorch/pytorch/issues/107960#issuecomment-1783432552 try this