Speech synthesis - Githubissues

Design idea... Three layers:

Vocal tract synthesizer program. A program that is started in a silent state, optionally with a number of init arguments, and then responds to a standardized set of messages that control pitch and formants, trigger plosives etc. These programs would essentially be designed like (and also usable as) musical instruments. Not necessarily much speech synthesis specific about them, apart from the timbres (typically) being more or less humanoid.
- We could even split this up further, allowing a vocal tract synth to be constructed from separate programs for vocal cords, different resonances, different plosive generators etc.
Speech modulator program. An intermediate level that defines how phonemes are actually pronounced. Basically an interactive sequencer that's driven by phoneme messages, and send control messages to vocal tract synthesizers. We'll probably need some new datatypes and A2S constructs to implement this sensibly, if we are to do it in A2S at all.
Dictionary, phrasing etc. This is probably best left to external libs, or even left out altogether, if we only need to deal with pre-composed words and messages.

olofson / audiality2