p0p4k / pflowtts_pytorch

Unofficial implementation of NVIDIA P-Flow TTS paper
https://neurips.cc/virtual/2023/poster/69899
MIT License
198 stars 28 forks source link

requirements & phonemizer ? #1

Closed zidsi closed 8 months ago

zidsi commented 8 months ago

First of all - thank you for your effort and quick implementation.

In Instructions to run section of README you suggest

pip install -r requirements.txt

however it looks like requirements.txt is missing in repo. (not critical)

Additional Q about dataset preparation. Paper says: A G2P model [5] preprocesses the text into the International Phonetic Alphabet (IPA) format.

Do you train using IPA phonemes or straight chars?

p0p4k commented 8 months ago
zidsi commented 8 months ago

Tnx for info. I see phonemizer in text cleaners :( I was able to "skip" G2P step for VITS2 and it works good.