xcmyz / FastSpeech

The Implementation of FastSpeech based on pytorch.
MIT License
858 stars 213 forks source link

duration loss calulated in log domain or linear domain #94

Open MorganCZY opened 3 years ago

MorganCZY commented 3 years ago

I notice the original implementaion of fastspeech(integrated in ESPNet) adopts log domain to calculate the duration loss, which means target duration is first token the logarithm. In your version, the linear domain is used to directly calculate duration loss. Have you any ideas on both methods?