ming024 / FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
MIT License
1.69k stars 515 forks source link

Duration of synthesis output is very short #219

Open hplanmuc opened 7 months ago

hplanmuc commented 7 months ago

Train the model according to the tutorial provided by git(LJSpeech), I encountered this problem during training. shown below, the left side is the result of pre-training, and the right side is the result of my training. Have you encountered it before?

Before this problem, I also encountered a problem like this(https://github.com/ming024/FastSpeech2/issues/105), so I Annotation "model = nn.DataParallel(model)" in train.py and modify " torch.save( { "model": model.module.state_dict(), "optimizer": optimizer._optimizer.state_dict(), } " to " torch.save( { "model": model.state_dict(), "optimizer": optimizer._optimizer.state_dict(), } "

屏幕截图 2023-12-05 103744 屏幕截图 2023-12-01 105134