n4ze3m / dialoqbase

Create chatbots with ease
https://dialoqbase.n4ze3m.com/
MIT License
1.6k stars 264 forks source link

Integrate transcription and TTS (Voice to Voice) #82

Closed johnnybloem closed 12 months ago

johnnybloem commented 1 year ago

Please consider adding a version with whisper_jax for speech transcription. Add elevenlabs functionality for tts. You would then make this UI accessible for disabled people to converse with chat gpt voice to voice. Users can select voice or text interaction in the UI. This way your app will be more inclusive and accessible to everyone regardless of physical ability, Keep up the good work! Thank you for creating this excellent UI.

n4ze3m commented 1 year ago

Hey, this is a great feature request and idea. I will try to add this feature in an upcoming release.

johnnybloem commented 1 year ago

I am grateful thank you!

n4ze3m commented 1 year ago

Hey, a beta version of speech-to-text and text-to-speech has been added to the playground in v0.0.30 release

johnnybloem commented 1 year ago

You are awesome! Thank you so much!!

n4ze3m commented 1 year ago

for more details

https://github.com/n4ze3m/dialoqbase/releases/tag/v0.0.30

or

https://twitter.com/n4ze3m/status/1705651895832830005

johnnybloem commented 1 year ago

Excellent. Can I set my ElevenLabs voice id and settings like stability, speaker boost = True, style = 0.4 etc.

n4ze3m commented 1 year ago

Currently there is limited customization available for Elevenlabs, and I fetch Elevenlabs voices from this API endpoint: https://api.elevenlabs.io/docs#/voices/Get_voices_v1_voices_get. ~I don't think it includes custom voices (I need to create one to check haha)~. I will add more customization features in the coming updates

n4ze3m commented 1 year ago

I will reply here when it is released. :)

n4ze3m commented 1 year ago

Hey, custom voice ID works

image

johnnybloem commented 1 year ago

Thank you so much for checking!!! That rocks. Best UI on github!