SverreNystad / gpt-dungeon-master

Welcome to the GPT Dungeon Master repository! This project harnesses the power of GPT models to create a dynamic and responsive Dungeon Master (DM) for tabletop role-playing games (RPGs). Whether you're a seasoned player looking for a quick rule reference or a group in need of an AI-driven DM for your next adventure, the GPT Dungeon Master is here
34 stars 2 forks source link

Integration of Text-to-Speech (TTS) Feature for Enhanced Immersion #2

Open SverreNystad opened 10 months ago

SverreNystad commented 10 months ago

While the GPT Dungeon Master provides textual responses, having an auditory component can significantly enhance the immersion for players, especially during intense or dramatic moments in the game.

I propose the integration of a Text-to-Speech (TTS) feature that can convert the GPT DM's textual responses into spoken words. This would allow players to hear the DM's narration, making the gameplay experience more engaging and lifelike.

Proposed Features:

SverreNystad commented 10 months ago

OpenAIs guild on speech to text: https://platform.openai.com/docs/guides/speech-to-text Watsons tts: https://www.youtube.com/watch?v=8k8S5ruFAUs

SverreNystad commented 10 months ago

I started out using watson but lovo seems better as it allows us to change the speed and emotion of the voice. We can use classifiers or the LLM with function calling to get it to use correct emotion for voice and speed of it https://api-doc.lovo.ai/

SverreNystad commented 10 months ago

We need to discuss what is the best text to speech bot