Closed astricks closed 3 years ago
Do you have tensorboard outputs for spectrograms?
Please find the tensorflow events file at the link below. Please let me know if I can provide any more information.
https://drive.google.com/drive/folders/1nlpz6uPVaSLqfP61zWF_Po1XFuex4C5z?usp=sharing
@astricks Hey I've had a look at your logs and the spectrograms seem broken. Are you sure that your dataset is okay?
One suggestion: You could try setting "spec_gain": 20
. With "spec_gain": 1 and not having a stats_file (see stats_path), people had similiar problems regarding the spectrograms in the past.
One question: Is there a reason you set "min_seq_len": 81? Seems a bit high, but I'm not sure in which way that might affect the model.
@SanjaESC I came up with min_seq_len=81 by passing sorting the sentences by length and using awk to calculate the length. Below is the shortest sentence. Passing it through phenomize gives length 82, so figured i'd go with the shorter one. Do these have to be exact numbers?
देशवासियों आप सब को नमस्कार 2015।
I'll try increasing spec_gain to 20 and running again.
@SanjaESC I came up with min_seq_len=81 by passing sorting the sentences by length and using awk to calculate the length. Below is the shortest sentence. Passing it through phenomize gives length 82, so figured i'd go with the shorter one. Do these have to be exact numbers?
They don't have to be exact numbers. Those are variable options where you can decide which data to use. So in your case you use data with text length between min. 82 and max. 186 character length. When you start the training it should say at the beginning how many files where filtered based on your min/max settings.
I fixed the min/max lengths (6 and 280) and set spec_gain to 20, and its working fine! Thanks for all the help. I'll share the Hindi model once it's trained. Closing the issue.
Hi,
I am trying to train a Tacotron2 model in Hindi. I have my own 25 hour single speaker cleaned dataset. I'm using the following configuration.
{ "model": "Tacotron2", "run_name": "hindi-ddc", "run_description": "tacotron2 with DDC and differential spectral loss.",
}
--
The stacktrace I'm hitting is below.
--
I've been trying to debug for 2 days but not able to make progress. I'd really appreciate any help/suggestions.