NVIDIA / tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference
BSD 3-Clause "New" or "Revised" License
5.07k stars 1.38k forks source link

Tacotron lost some words when inference ? #452

Open leminhnguyen opened 3 years ago

leminhnguyen commented 3 years ago

Any suggestions? Thank in advance.

EuphoriaCelestial commented 3 years ago

I am facing the same error btw, which language are you training?

leminhnguyen commented 3 years ago

I am facing the same error btw, which language are you training?

Synthesized speech is usually not robust. Due to error propagation [3] and the wrong attention alignments between text and speech in the autoregressive generation, the generated mel-spectrogram is usually deficient with the problem of words skipping and repeating [19]