enricoros / big-AGI

Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.
https://big-agi.com
MIT License
4.55k stars 1.04k forks source link

[Roadmap] Change Microphone Input #501

Closed liquidsnakeblue closed 1 month ago

liquidsnakeblue commented 1 month ago

Why Users will be able to change the input microphone

Description Just need to be able to select the input source for the microphone. I have many different inputs and even though I set the default browser input correctly, the call feature still does not detect any input from my voice.

Requirements If you can, Please break-down the changes use cases, UX, technology, architecture, etc.

enricoros commented 1 month ago

Unfortunately the Web Speech API which is what we use for the speech to text, does not support changing the input.

I've looked into the spec of the API and open bags and I couldn't find any information nor any technical way of doing this.

To enable switching of the microphone we will need to have a full in house media Pipeline and using stt voice transcription services. That functionality may come but it's way broader than just changing the input.