jrovu / LinguaFreq-text2speech

Creates voice audio files from a list of sentences, using AWS Polly (text-to-speech)
1 stars 0 forks source link

As a student, I should be able to listen to "pyramid"-style tongue-twisters, so that I can learn difficult language sentences. #25

Open jrovu opened 4 years ago

jrovu commented 4 years ago

Requirements / Concerns

Example: Pause + I'm going to the bank + Pause + I'm + Pause + I'm going + Pause X2 + I'm going to + Pause X3 + I'm going to the + Pause X 4 + I'm going to the bank + Pause X 5 + pause (just to end it)

Phrase Pyramid Audio - From one phrase, it creates Complete Phrase + Pause + Word1 + Pause + Word1 + Word 2 + PauseX2 + Word1 + Word 2 + Word 3 + PauseX3 + Word1 + Word 2 + Word 3 + Word 4 + PauseX4. As an end user, I'd like a listen & repeat MP3 that starts with the first word + a pause, and keeps adding the next word to "hold my hand" through a sentence.

jrovu commented 4 years ago

FYI @ctparadise I moved these comments from the XLS and created this feature issue.

jrovu commented 4 years ago

Made good progress on this today.

Accomplishments:

Next steps:

jrovu commented 4 years ago

This is the way to run the program in "pyramid" mode.

./tts.py -f pyramid_chinese.csv --foreign_voice Zhiyu --english_voice_engine neural --mode pyramid --speed 80

If you encounter a problem, run it in "verbose" mode, like: ./tts.py -f pyramid_chinese.csv --foreign_voice Zhiyu --english_voice_engine neural -v --mode pyramid --speed 80