r9y9 / tacotron_pytorch

PyTorch implementation of Tacotron speech synthesis model.
http://nbviewer.jupyter.org/github/r9y9/tacotron_pytorch/blob/master/notebooks/Test%20Tacotron.ipynb
Other
306 stars 79 forks source link

why is_end_of_frames can detection the end frame in test phase? #15

Closed wangjunchao1118 closed 5 years ago

wangjunchao1118 commented 6 years ago

thanks for your code. I have a question about the tacotron_pytorch/tacotron.py line 274. why output.data <= 0.2 is the end frame in test phase. if i use this funtion, i only can decode 2 step in test time.

def is_end_of_frames(output, eps=0.2): return (output.data <= eps).all()

r9y9 commented 6 years ago

This is the heuristic I found from my experiments. It may not work for you depending on your experimental settings. If it does not work then you can try your own method.

PetrochukM commented 6 years ago

You can train a scalar on your test data to fit this parameter!

wangjunchao1118 commented 6 years ago

thanks for your reply. I use blizzard 2011 nancy data to train tacotron. In my test , i can't find a good eps value for my test sentence. so when i decode a sentence , i decode T_encoder times in test phase. it seems work well.

stale[bot] commented 5 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.