Hi - I tried your alternate model, and it worked good easily, so I am thankful for your work.
But I noticed the output of your melspectrogram() function clips to 1.0 often on LJSpeech data.
(Of course, it might be my bad implementation).
But also it seems the code is similar to keithito/tacotron. In Keith's version he later changed one line to
S = _amp_to_db(_linear_to_mel(np.abs(D))) - hparams.ref_level_db
in response to an "issue" sent in by Rafael Valle. I wonder whether this difference was intentional or not, (or maybe not relevant).
Thanks.
Hi - I tried your alternate model, and it worked good easily, so I am thankful for your work. But I noticed the output of your melspectrogram() function clips to 1.0 often on LJSpeech data. (Of course, it might be my bad implementation). But also it seems the code is similar to keithito/tacotron. In Keith's version he later changed one line to S = _amp_to_db(_linear_to_mel(np.abs(D))) - hparams.ref_level_db in response to an "issue" sent in by Rafael Valle. I wonder whether this difference was intentional or not, (or maybe not relevant). Thanks.