Yangyangii / TPGST-Tacotron

Google's TPGST reimplementation.
34 stars 8 forks source link
github gst gst-tacotron kss pytorch tacotron tpgst tts

TPGST reimplementation with pytorch

Paper: PREDICTING EXPRESSIVE SPEAKING STYLE FROM TEXT IN END-TO-END SPEECH SYNTHESIS

Prerequisite

Dataset

Samples

Usage

  1. Download the above dataset and modify the path in config.py. And then run the below command.

    python prepro.py
  2. The model needs to train 100k+ steps

    python train.py <gpu_id>
  3. After training, you can synthesize some speech from text.

    python synthesize.py <gpu_id> <model_path>
  4. To listen your samples, you may need mel2wav vocoder. I didn't include vocoder in this repo.

Notes