Closed MahirEusufzai closed 7 years ago
There is no exact mapping in seq2seq, but there are attention weights which are shifting from start to end, by learning the weights you can see what phoneme was derived from which chunk of the input string. One has to modify the seq2seq code to retrieve them though, see for details:
https://stackoverflow.com/questions/40601552/visualizing-attention-activation-in-tensorflow
Is it feasible to show which phonemes represent which segment of each word?
For example, instead of the output being
it would be