NVIDIA / DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
12.93k stars 3.12k forks source link

Seeking Help with Tacotron 2 Training for Telugu Language #1360

Open gujjulassr opened 8 months ago

gujjulassr commented 8 months ago

alignment_0 Hello everyone,

I hope this message finds you well. I'm currently working on training a Tacotron 2 model for the Telugu language, and I've encountered some challenges with alignment and output. I would greatly appreciate your expertise and guidance to help me address these issues.

Problem Description:

Issues:

  1. Alignment: I'm struggling to obtain accurate phoneme alignment during training, which is crucial for generating clear and coherent speech.
  2. Output: The generated Telugu speech is not understandable and appears to be random sounds, even though my training data is in Telugu.

Request for Help: I would appreciate any assistance or advice regarding the following:

I'm open to any insights, recommendations, or best practices that can help me improve the quality of my model's output. If you have experience with TTS in non-English languages, particularly Telugu, your expertise would be invaluable.

Thank you in advance for your time and support. I'm eager to learn and make progress on this project, and your guidance will be instrumental.

Feel free to ask for additional information or logs if needed. Your help is greatly appreciated.

Best regards, G samaram

### Tasks