[Bug] RuntimeError: Error opening '/content/TTS/filelists/wavs/"yoon-349.wav': System error.

Welcome to the 🐸TTS project! We are excited to see your interest, and appreciate your support!

This repository is governed by the Contributor Covenant Code of Conduct. For more details, see the CODE_OF_CONDUCT.md file.

If you've found a bug, please provide the following information:

Describe the bug I am trying to use other dataset(Korean) and I don't know why this directory path includes "(quotation mark) in the middle with no reason during running. Check below error message and find the (") before yoon-349

RuntimeError: Error opening '/content/TTS/filelists/wavs/"yoon-349.wav': System error.

my metadata.csv does not include any quotation mark.

Do you have similar bugs reported?

To Reproduce Steps to reproduce the behavior:

My command is below

python TTS/bin/train_glow_tts.py \ --config_path "/content/drive/My Drive/Colab Notebooks/data/glowtts-v2/config.json" \ --coqpit.datasets.0.path "/content/TTS/filelists" \ --coqpit.audio.stats_path "/content/drive/My Drive/Colab Notebooks/data/glowtts-v2/scale_stats_new.npy" \ --coqpit.test_sentences_file "/content/TTS/test_sentences.txt" \ --coqpit.output_path "/content/drive/My Drive/Colab Notebooks/data/glowtts-v2/" \ --coqpit.num_loader_workers 2 \ --coqpit.num_val_loader_workers 2 \ --restore_path "/content/drive/My Drive/Colab Notebooks/data/glowtts-v2/model_file.pth.tar")

Expected behavior A clear and concise description of what you expected to happen.

Environment (please complete the following information):

OS Platform and Distribution (e.g., Linux Ubuntu 16.04):
PyTorch or TensorFlow version (use command below):
Python version:
CUDA/cuDNN version:
GPU model and memory:
Exact command to reproduce:

Additional context

This is my full error script

/content/TTS

Using CUDA: True Number of GPUs: 1 Git Hash: 3aa165a Experiment folder: /content/drive/My Drive/Colab Notebooks/data/glowtts-v2/glowtts-v2-December-06-2021_07+16PM-3aa165a Setting up Audio Processor... | > sample_rate:22050 | > resample:False | > num_mels:80 | > min_level_db:-100 | > frame_shift_ms:None | > frame_length_ms:None | > ref_level_db:20 | > fft_size:1024 | > power:1.1 | > preemphasis:0.98 | > griffin_lim_iters:60 | > signal_norm:True | > symmetric_norm:True | > mel_fmin:0 | > mel_fmax:8000.0 | > spec_gain:20.0 | > stft_pad_mode:reflect | > max_norm:4.0 | > clip_norm:True | > do_trim_silence:False | > trim_db:60 | > do_sound_norm:False | > stats_path:/content/drive/My Drive/Colab Notebooks/data/glowtts-v2/scale_stats_new.npy | > base:10 | > hop_length:256 | > win_length:1024 | > Found 850 files in /content/TTS/filelists Using model: glow_tts /content/TTS/TTS/tts/layers/glow_tts/glow.py:84: UserWarning: torch.qr is deprecated in favor of torch.linalg.qr and will be removed in a future PyTorch release. The boolean parameter 'some' has been replaced with a string parameter 'mode'. Q, R = torch.qr(A, some) should be replaced with Q, R = torch.linalg.qr(A, 'reduced' if some else 'complete') (Triggered internally at ../aten/src/ATen/native/BatchLinearAlgebra.cpp:1937.) w_init = torch.qr(torch.FloatTensor(self.num_splits, self.numsplits).normal())[0] Restoring from model_file.pth.tar ... Model restored from step 26000

Model has 28612177 parameters Starting with inf best loss.

DataLoader initialization | > Use phonemes: False | > Number of instances : 842 | > Max length sequence: 46 | > Min length sequence: 6 | > Avg length sequence: 19.327790973871736 | > Num. instances discarded by max-min (max=500, min=3) seq limits: 0 | > Batch group size: 128.

DataLoader initialization | > Use phonemes: False | > Number of instances : 8 | > Max length sequence: 31 | > Min length sequence: 18 | > Avg length sequence: 22.125 | > Num. instances discarded by max-min (max=500, min=3) seq limits: 0 | > Batch group size: 0. Data depended initialization ... /content/TTS/TTS/tts/models/glow_tts.py:291: UserWarning: floordiv is deprecated, and its behavior will change in a future version of pytorch. It currently rounds toward 0 (like the 'trunc' function NOT 'floor'). This results in incorrect rounding for negative values. To keep the current behavior, use torch.div(a, b, rounding_mode='trunc'), or for actual floor division, use torch.div(a, b, rounding_mode='floor'). y_lengths = (y_lengths // self.num_squeeze) * self.num_squeeze

EPOCH: 0/10000

TRAINING (2021-12-06 19:16:53) ! Run is removed from /content/drive/My Drive/Colab Notebooks/data/glowtts-v2/glowtts-v2-December-06-2021_07+16PM-3aa165a Traceback (most recent call last): File "TTS/bin/train_glow_tts.py", line 588, in main(args) File "TTS/bin/train_glow_tts.py", line 562, in main train_loader, model, criterion, optimizer, scheduler, ap, global_step, epoch File "TTS/bin/train_glow_tts.py", line 177, in train for num_iter, data in enumerate(data_loader): File "/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py", line 521, in next data = self._next_data() File "/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py", line 1203, in _next_data return self._process_data(data) File "/usr/local/lib/python3.7/dist-packages/torch/utils/data/dataloader.py", line 1229, in _process_data data.reraise() File "/usr/local/lib/python3.7/dist-packages/torch/_utils.py", line 434, in reraise raise exception RuntimeError: Caught RuntimeError in DataLoader worker process 0. Original Traceback (most recent call last): File "/usr/local/lib/python3.7/dist-packages/torch/utils/data/_utils/worker.py", line 287, in _worker_loop data = fetcher.fetch(index) File "/usr/local/lib/python3.7/dist-packages/torch/utils/data/_utils/fetch.py", line 49, in fetch data = [self.dataset[idx] for idx in possibly_batched_index] File "/usr/local/lib/python3.7/dist-packages/torch/utils/data/_utils/fetch.py", line 49, in data = [self.dataset[idx] for idx in possibly_batched_index] File "/content/TTS/TTS/tts/datasets/TTSDataset.py", line 273, in getitem return self.load_data(idx) File "/content/TTS/TTS/tts/datasets/TTSDataset.py", line 142, in load_data wav = np.asarray(self.load_wav(wav_file), dtype=np.float32) File "/content/TTS/TTS/tts/datasets/TTSDataset.py", line 87, in load_wav audio = self.ap.load_wav(filename) File "/content/TTS/TTS/utils/audio.py", line 386, in load_wav x, sr = sf.read(filename) File "/usr/local/lib/python3.7/dist-packages/soundfile.py", line 257, in read subtype, endian, format, closefd) as f: File "/usr/local/lib/python3.7/dist-packages/soundfile.py", line 629, in init self._file = self._open(file, mode_int, closefd) File "/usr/local/lib/python3.7/dist-packages/soundfile.py", line 1184, in _open "Error opening {0!r}: ".format(self.name)) File "/usr/local/lib/python3.7/dist-packages/soundfile.py", line 1357, in _error_check raise RuntimeError(prefix + _ffi.string(err_str).decode('utf-8', 'replace')) RuntimeError: Error opening '/content/TTS/filelists/wavs/"yoon-349.wav': System error.

coqui-ai / TTS

[Bug] RuntimeError: Error opening '/content/TTS/filelists/wavs/"yoon-349.wav': System error. #999