jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
https://jaywalnut310.github.io/vits-demo/index.html
MIT License
6.72k stars 1.23k forks source link

Runtime error #162

Closed YTRemRem closed 1 year ago

YTRemRem commented 1 year ago

-- Process 0 terminated with the following error: Traceback (most recent call last): File "C:\Users\rem\anaconda3\envs\vits\lib\site-packages\torch\multiprocessing\spawn.py", line 69, in _wrap fn(i, *args) File "F:\AI Voice (VITS - SoVITS)\vits orig\train.py", line 117, in run train_and_evaluate(rank, epoch, hps, [net_g, net_d], [optim_g, optim_d], [scheduler_g, scheduler_d], scaler, [train_loader, eval_loader], logger, [writer, writer_eval]) File "F:\AI Voice (VITS - SoVITS)\vits orig\train.py", line 137, in train_and_evaluate for batch_idx, (x, x_lengths, spec, spec_lengths, y, y_lengths) in enumerate(train_loader): File "C:\Users\rem\anaconda3\envs\vits\lib\site-packages\torch\utils\data\dataloader.py", line 633, in next data = self._next_data() File "C:\Users\rem\anaconda3\envs\vits\lib\site-packages\torch\utils\data\dataloader.py", line 1345, in _next_data return self._process_data(data) File "C:\Users\rem\anaconda3\envs\vits\lib\site-packages\torch\utils\data\dataloader.py", line 1371, in _process_data data.reraise() File "C:\Users\rem\anaconda3\envs\vits\lib\site-packages\torch_utils.py", line 644, in reraise raise exception RuntimeError: Caught RuntimeError in DataLoader worker process 0. Original Traceback (most recent call last): File "C:\Users\rem\anaconda3\envs\vits\lib\site-packages\torch\utils\data_utils\worker.py", line 308, in _worker_loop data = fetcher.fetch(index) File "C:\Users\rem\anaconda3\envs\vits\lib\site-packages\torch\utils\data_utils\fetch.py", line 51, in fetch data = [self.dataset[idx] for idx in possibly_batched_index] File "C:\Users\rem\anaconda3\envs\vits\lib\site-packages\torch\utils\data_utils\fetch.py", line 51, in data = [self.dataset[idx] for idx in possibly_batched_index] File "F:\AI Voice (VITS - SoVITS)\vits orig\data_utils.py", line 94, in getitem return self.get_audio_text_pair(self.audiopaths_and_text[index]) File "F:\AI Voice (VITS - SoVITS)\vits orig\data_utils.py", line 62, in get_audio_text_pair spec, wav = self.get_audio(audiopath) File "F:\AI Voice (VITS - SoVITS)\vits orig\data_utils.py", line 76, in get_audio spec = spectrogram_torch(audio_norm, self.filter_length, File "F:\AI Voice (VITS - SoVITS)\vits orig\mel_processing.py", line 66, in spectrogram_torch spec = torch.stft(y, n_fft, hop_length=hop_size, win_length=win_size, window=hann_window[wnsize_dtype_device], File "C:\Users\rem\anaconda3\envs\vits\lib\site-packages\torch\functional.py", line 641, in stft return _VF.stft(input, n_fft, hop_length, win_length, window, # type: ignore[attr-defined] RuntimeError: stft requires the return_complex parameter be given for real inputs, and will further require that return_complex=True in a future PyTorch release.

ZodiacFRA commented 3 weeks ago

I'm getting the same error, how did you fix it @YTRemRem ?

YTRemRem commented 3 weeks ago

It's been a while, I have forgotten how to solve it, sorry for that.