pipecat-ai / pipecat

Open Source framework for voice and multimodal conversational AI
BSD 2-Clause "Simplified" License
3.17k stars 280 forks source link

Interrupted by itself when speaker on #188

Open ramishi opened 4 months ago

ramishi commented 4 months ago

using the Simple-chatbot example and when I turn on speaker, the bot heard itself and interrupted. is there a way to avoid this?

chadbailey59 commented 4 months ago

That can be tough, because the bot talking sounds like a person. :) Were you using the Daily transport? Were you using Silero VAD, or the built-in WebRTC VAD?

jinqiupeter commented 4 months ago

One option could be Speaker Diarization, which first detects voice (human voice or speaker's voice), then compares the detected voice with the voice currently speaking, and then decides if interruption is needed.

n4cr commented 2 months ago

Would it be possible to disable VAD only until the LLM is finished talking? Disabling VAD through Daily Transport now fully disables getting a response from LLM.

@chadbailey59 I've been using the Daily transport and Silero VAD