elevenlabs / elevenlabs-docs

Documentation for elevenlabs.io/docs
https://elevenlabs.io/docs
67 stars 289 forks source link

Tonal languages have major mispronunciations #370

Open joseph2mi opened 3 months ago

joseph2mi commented 3 months ago

Path: /speech-synthesis/prompting

Unlike English, you cannot prompt "correct" pronunciations on languages like Chinese or Vietnamese. No matter how close the voices may sound, Microsoft Azure voices will still beat Eleven labs for the sole fact that the speech generated often messes up tonal markers when Azure doesn't. This is an important training base that needs to be fine-tuned.

louisjoecodes commented 1 month ago

Hi @joseph2mi have you tried our pronunciations feature? Could you give us some examples? Thanks!