Closed napalm00 closed 4 years ago
Try to re-run the script, there is some randomness in prediction time too. This happens because attention sometimes is equal on multiple timesteps. Add/change the seed to get more control.
My mistake, I ran create_dataset.py
using the ljspeech_melgan_forward_transformer
configs instead of the (correct) ljspeech_melgan_autoregressive_transformer
configs.
Re-creating the dataset with the proper configs and running extract_durations worked properly.
Happens when running
python extract_durations.py --config ../ljspeech_melgan_autoregressive_transformer/melgan --binary --fix_jumps --fill_mode_next
on an autoregressive model trained to step 1,110,000 on a new dataset (restored from checkpoint 900k from the released model weights, commit 1c1cb03).
Also happens when using just the released 900k checkpoint with no training on the new dataset.
Any ideas what might be wrong? Does it need more training?