rhasspy / rhasspy3

An open source voice assistant toolkit for many human languages
MIT License
295 stars 23 forks source link

Multi-Channel Audio Input Issues #56

Open vshtykhnov opened 9 months ago

vshtykhnov commented 9 months ago

Description

I've encountered an issue with Rhasspy 3 when using multi-channel audio input, specifically when more than one channel is used (e.g., arecord -c 4). In these cases, the wake word detection becomes unresponsive, and the application seems to hang during audio processing.

Steps to Reproduce

  1. Configure Rhasspy 3 to use an audio input source with multiple channels (e.g., using arecord with -c 4 for four channels).
  2. Attempt to trigger the wake word.

Expected Behavior

Rhasspy should process multi-channel audio inputs correctly, either by internally converting them to a single channel for processing or by handling multi-channel data without issues.

Actual Behavior

When multi-channel audio is used:

Questions and Concerns

Additional Context

The issue seems to revolve around how Rhasspy 3 interacts with multi-channel audio inputs and its impact on subsequent processing stages, particularly wake word detection.

Any insights or assistance on this matter would be highly valuable.

Thank you for your attention to this issue.