Open jrovu opened 4 years ago
FYI @ctparadise I moved these comments from the XLS and created this feature issue.
Made good progress on this today.
Accomplishments:
Next steps:
This is the way to run the program in "pyramid" mode.
./tts.py -f pyramid_chinese.csv --foreign_voice Zhiyu --english_voice_engine neural --mode pyramid --speed 80
If you encounter a problem, run it in "verbose" mode, like:
./tts.py -f pyramid_chinese.csv --foreign_voice Zhiyu --english_voice_engine neural -v --mode pyramid --speed 80
Requirements / Concerns
Example: Pause + I'm going to the bank + Pause + I'm + Pause + I'm going + Pause X2 + I'm going to + Pause X3 + I'm going to the + Pause X 4 + I'm going to the bank + Pause X 5 + pause (just to end it)
Phrase Pyramid Audio - From one phrase, it creates Complete Phrase + Pause + Word1 + Pause + Word1 + Word 2 + PauseX2 + Word1 + Word 2 + Word 3 + PauseX3 + Word1 + Word 2 + Word 3 + Word 4 + PauseX4. As an end user, I'd like a listen & repeat MP3 that starts with the first word + a pause, and keeps adding the next word to "hold my hand" through a sentence.