Closed maximaminima closed 2 months ago
Tracking here test_stt_transcription_loop
with few tests to compare m4a and wav audio streams.
Directly reading audio buffer reduces the latency of processing by skipping file conversion.
Think about how to remove silences - next step.
@bsbarkur I added is_silent logic to this branch. Tested and latency is not affected much by adding is_slient. Please test it and revert the commit incase of any issues.
Closing this issue
Need this perhaps for streaming transcription as per groq docs. m4a is better for streaming apparently than wav.