Gencaster / gencaster

A non-linear audio streaming framework for real-time radiophonic experiences and live music.
https://gencaster.org
GNU Affero General Public License v3.0
19 stars 0 forks source link

text 2 speech API #15

Closed capital-G closed 2 years ago

capital-G commented 2 years ago

Implement a text2speech API in backend - @vin-ni has a nodejs prototype but we need this in python. We need to accept a request and save it as wav (or mp3, ogg...) in the backend.

Must be async.

vin-ni commented 2 years ago

Directly use ssml:

<speak>
    Über
</speak>

Previous Request in Node.js:

  // Construct the request
  const request = {
    input: { text: sentence },
    // Select the language and SSML voice gender (optional)
    voice: {
      languageCode: 'de-DE',
      ssmlGender: 'MALE',
      name: 'de-DE-Standard-B,
    },
    // select the type of audio encoding
    audioConfig: { audioEncoding:'wav' },
  };