lobehub / lobe-chat

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / DeepSeek),Knowledge Base(file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.
https://chat-preview.lobehub.com
Other
37.4k stars 8.83k forks source link

[Request] Voice Chat Mode #1412

Open thiner opened 5 months ago

thiner commented 5 months ago

🥰 Feature Description

ChatGPT-like voice chat mode.

🧐 Proposed Solution

LobeChat has already had the ability of TTS and STT, can we move one step forward, enable the voice chat mode by send user speech to server and convert server response to voice automatically?

📝 Additional Information

No response

lobehubbot commented 5 months ago

👀 @thiner

Thank you for raising an issue. We will investigate into the matter and get back to you as soon as possible. Please make sure you have given us as much context as possible.\ 非常感谢您提交 issue。我们会尽快调查此事,并尽快回复您。 请确保您已经提供了尽可能多的背景信息。

arvinxx commented 5 months ago

Yeah, we have a plan ! please track it here: #480

kernkraft235 commented 4 months ago

@arvinxx please include the option to just have the assistant speak their response automatically (without explicitly being in a 2-way voice chat)

I was excited to see an app like this offer the TTS option, but then I saw it has to be manually selected from a nested menu each time.

My desired use case is to just write my question, hit enter and then get back to what I was doing with the LLM's response automatically playing with OAI TTS, I have to use python and janky iOS shortcuts to even get close to this working as well as I'd like

Every single app (native or 3rd party) seems to not see a reason to have a user type their part, but the LLM to respond in voice.