Hi how can learn model new words that don't see it.
How can learn model a new pronunciation and how can deal with crossover pronunciation of two neighbor words?
In HMM-based we can generate phones of a sentence and pass it to model, what about in this end to end model? I think that the network is too close.
Is it possible that record few new waves by new speaker and pass it to model to learn?
best regards
Hi how can learn model new words that don't see it. How can learn model a new pronunciation and how can deal with crossover pronunciation of two neighbor words? In HMM-based we can generate phones of a sentence and pass it to model, what about in this end to end model? I think that the network is too close. Is it possible that record few new waves by new speaker and pass it to model to learn? best regards