Nuvotion-Visuals / Harmony3

1 stars 0 forks source link

Implement Wake Word Detection with Voice Activity Detection (VAD) Integration #41

Open tom-leamon opened 6 months ago

tom-leamon commented 6 months ago

Overview

We are looking to enhance our audio processing application by integrating wake word detection into our existing live rolling window transcription system. The goal is to efficiently detect specified wake words ("Hey Siri"-like functionality) using client-side VAD to start and stop sending audio data to the server for processing.

Current Implementation

Proposed Implementation

Client-Side
Server-Side

Goals

Additional Considerations

Next Steps