m-toman / tacorn

2018/2019 TTS framework integrating state of the art open source methods
MIT License
47 stars 4 forks source link

Implementation plan #10

Closed Yeongtae closed 5 years ago

Yeongtae commented 5 years ago

Data feeding

m-toman commented 5 years ago

Thanks. I've updated to the latest version of Tacotron-2 and forked this version. Currently training the LJ taco model.

bjtommychen commented 5 years ago

Thanks you both. I am looking forward to your progress.

Yeongtae commented 5 years ago

Intro

WaveRNN use GRU operation optimization techniques. To figure out efficiency of operation optimization, I had completed the GRU opreation optimization experiment.

I had compared Pythorch's built-in GRU module with self implementation of GRU operation optimization

Experimental setup

I made a simple two-dimensional coordinate prediction problem, f(xi) = y = x(i+1), The dataset was generated by using r x rotationmatrix(theta).[1, 0 ]^T. image Device: k80 GPU

Implementation

Built-in GRU module image Self implementation of original GRU image Self implementation of GRU operation optimization image

Experimental result

I performed full batch inferences, 10000 times each.

Built-in GRU module: 6.02 seconds Self implementation of original GRU: 16.0 seconds Self implementation of GRU operation optimization: 9.70 seconds

The pytorch build-in GRU module is well optimized. Because it use cudnn_gru, image check https://github.com/rossumai/OCkRE/issues/2

Conclusion

In my opinion, it's faster to use built-in GRU in pytorch.

m-toman commented 5 years ago

Thanks for the experiment. Unfortunately I'm in hospital and couldn't continue to integrate the original wavernn implementation for now, but it's still on my agenda.

Yeongtae commented 5 years ago

I'm trying to use faster wave generator, which is waveglow. Sorry about stopping implementation.

m-toman commented 5 years ago

Perhaps this post is interesting for you: https://github.com/mozilla/TTS/issues/9#issuecomment-440976514 I'm generally still watching the Mozilla TTS repo to check their progress with WaveRNN.

m-toman commented 5 years ago

I'll close this for now.