Integration with ElevenlabsAPI: Voices sound different over API than on website

inclusion-international / speech-jokey

A speech synthesis software with integration of several TTS APIs, SSML support and optimizations for users with motor impairment. (course ASSIST HEIDI WS2023 and SS2024)

MIT License

1 stars 0 forks source link

my initial assumption would be that indeed APIs can change and also behave differently than what the provider puts on their website.

You need to keep in mind that the ElevenLabs Machine Learning Model is HEAVILY influenced by what is actually written in the text.

It is a generative model, so there is not much determinism when using it, albeit at the potential of much better & refined output.

If for example you write a text with 'Speak this in a brutal and very manly way', the AI would be inclined to do so, even though the voice chosen is female.

And there can be nuances in writing which have an impact in that regard..

If things like that come across to you, it's always best if you provide your inputs that gave you the results, otherwise it is near-impossible for me to reproduce

inclusion-international / speech-jokey

Integration with ElevenlabsAPI: Voices sound different over API than on website #5