Unlike English, you cannot prompt "correct" pronunciations on languages like Chinese or Vietnamese. No matter how close the voices may sound, Microsoft Azure voices will still beat Eleven labs for the sole fact that the speech generated often messes up tonal markers when Azure doesn't. This is an important training base that needs to be fine-tuned.
Path: /speech-synthesis/prompting
Unlike English, you cannot prompt "correct" pronunciations on languages like Chinese or Vietnamese. No matter how close the voices may sound, Microsoft Azure voices will still beat Eleven labs for the sole fact that the speech generated often messes up tonal markers when Azure doesn't. This is an important training base that needs to be fine-tuned.