2 Questions - Githubissues

You will need to replace this https://github.com/huawei-noah/Speech-Backbones/blob/main/Grad-TTS/model/text_encoder.py#L305 with PL-BERT model.
Since it shall be trained on texts, any text corpus that is closely related to what your downstream TTS datasets would be suitable. Since most publicly available TTS corpus is audiobook reading, Wikipedia is the best publicly available corpus for training, but you can definitely train on other text corpus if you know what you want for your downstream TTS tasks.

yl4579 / PL-BERT