coqui-ai / TTS

πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
http://coqui.ai
Mozilla Public License 2.0
31.76k stars 3.8k forks source link

[Feature request] Appropriate intonation using xtts_v2 und voice cloning #3573

Open Bardo-Konrad opened 4 months ago

Bardo-Konrad commented 4 months ago

πŸš€ Feature Description

Appropriate intonation using xtts_v2 und voice cloning

Solution

There is a certain structure to intonation that gives a natural flow, the same with using pauses. So the sentences spoken should also be analyzed for what a speaker intonates and when he uses pauses to adapt to new contexts semantically.

stale[bot] commented 3 months ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.