souzatharsis / podcastfy

Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
https://www.podcastfy.ai
Other
274 stars 33 forks source link

Integrate with Google TTS #23

Open souzatharsis opened 5 days ago

souzatharsis commented 5 days ago

The advantage there is more SSML support and cheaper.

Suggested by Gilgamesh from NotebookLM Discord.

brumar commented 9 hours ago

SSML is definitively interesting. Azure speech services supports it too. But in the context of a generative AI project, do we expect the user to edit the transcript to add these markups, or are there way to generate or enrich an existing transcript with these markups? Do anyone has some success stories about leveraging gen AI this way?

Anyway, that was more a sidetrack than a real meaningful comment. The more tts we have, the better :)