Open lyjgo opened 5 days ago
I met an error when I run the train_tacotron_ddc.py in TTS/recipes/ljspeech/tacotron2-DDC with the default config. The error and the config are as follows: ERROR
CONFIG audio_config = BaseAudioConfig( sample_rate=22050, do_trim_silence=True, trim_db=60.0, signal_norm=False, mel_fmin=0.0, mel_fmax=8000, spec_gain=1.0, log_func="np.log", ref_level_db=20, preemphasis=0.0, )
config = Tacotron2Config( # This is the config that is saved for the future use audio=audio_config, batch_size=64, eval_batch_size=16, num_loader_workers=4, num_eval_loader_workers=4, run_eval=True, test_delay_epochs=-1, r=6, gradual_training=[[0, 6, 64], [10000, 4, 32], [50000, 3, 32], [100000, 2, 32]], double_decoder_consistency=True, epochs=1000, text_cleaner="phoneme_cleaners", use_phonemes=True, phoneme_language="en-us", phoneme_cache_path=os.path.join(output_path, "phoneme_cache"), precompute_num_workers=8, print_step=25, print_eval=True, mixed_precision=False, output_path=output_path, datasets=[dataset_config], )
Is there anything I can do to solve this problem? Thanks
No response
"Packages": { "PyTorch_debug": false, "PyTorch_version": "2.2.0+cu118", "TTS": "0.22.0", "numpy": "1.22.0" }
full ERROR information:
Describe the bug
I met an error when I run the train_tacotron_ddc.py in TTS/recipes/ljspeech/tacotron2-DDC with the default config. The error and the config are as follows: ERROR![6208dece6097660c8fa1dc0b47c2daa](https://github.com/coqui-ai/TTS/assets/56882365/cf2c94cf-5ae0-4337-b5b2-b02db931fe35)
CONFIG audio_config = BaseAudioConfig( sample_rate=22050, do_trim_silence=True, trim_db=60.0, signal_norm=False, mel_fmin=0.0, mel_fmax=8000, spec_gain=1.0, log_func="np.log", ref_level_db=20, preemphasis=0.0, )
config = Tacotron2Config( # This is the config that is saved for the future use audio=audio_config, batch_size=64, eval_batch_size=16, num_loader_workers=4, num_eval_loader_workers=4, run_eval=True, test_delay_epochs=-1, r=6, gradual_training=[[0, 6, 64], [10000, 4, 32], [50000, 3, 32], [100000, 2, 32]], double_decoder_consistency=True, epochs=1000, text_cleaner="phoneme_cleaners", use_phonemes=True, phoneme_language="en-us", phoneme_cache_path=os.path.join(output_path, "phoneme_cache"), precompute_num_workers=8, print_step=25, print_eval=True, mixed_precision=False, output_path=output_path, datasets=[dataset_config], )
Is there anything I can do to solve this problem? Thanks
To Reproduce
Expected behavior
No response
Logs
No response
Environment
Additional context
No response