Duration loss stuck at 0 when use_energy_predictor is turned off

roedoejet / FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

MIT License

22 stars 7 forks source link

Just moving our email conversation to GitHub so that everyone can see the issue (and hopefully the fix!).

When running the train.py script, the output log files result in a duration loss of 0 when the use_energy_predictor config in model.yaml is set to false.

I am using the LJSpeech dataset provided here https://keithito.com/LJ-Speech-Dataset/ and the TextGrids provided here https://drive.google.com/drive/folders/1DBRkALpPd6FL9gjHMmMEdHODmkgNIIK4, retrieved from the README file.

This also affects any new languages trained as long as the use_energy_predictor config is set to false. (but recommended in your paper since you find it better for low resourced languages).

roedoejet / FastSpeech2

Duration loss stuck at 0 when use_energy_predictor is turned off #6