cmusphinx / g2p-seq2seq

G2P with Tensorflow
Other
670 stars 194 forks source link

Beamsearch for decoding instead of the current greedy search. #54

Closed bmilde closed 6 years ago

bmilde commented 8 years ago

Decoding is currently done with a greedy max for each decoding step. This will (severely) limit the performance of the model when you use it, as only the probability of the next phoneme character is maximized and not the probability of the whole phoneme sequence. Using beam search, a better log probability for the whole phoneme sequence could be found. I suggest to implement beam search by changing Tensorflows seq2seq decoding so that the decoding can be done step wise and then the beam search can be handled in Python.

This Tensorflow issue is related https://github.com/tensorflow/tensorflow/issues/654, some comments shared code of extended implementations with beam search.

nurtas-m commented 6 years ago

Solved in new versions of g2p-seq2seq starting from 6.0.0a0.