I am a newbie on attention based model.
I want to make an phonetic recognition in your library, will it work in practice ?
Suppose I have 10000 training data, consist different size of frames, each frames is 39 dimensional.
Could you give me some advices for implement these system ?
I am a newbie on attention based model. I want to make an phonetic recognition in your library, will it work in practice ? Suppose I have 10000 training data, consist different size of frames, each frames is 39 dimensional.
Could you give me some advices for implement these system ?