Open antimatter84 opened 1 year ago
Hi @antimatter84,
I had the exact same experience and what helped me greatly was playing around with the chunk_size parameter. In my case, setting it to 512 instead of the default 1024 drastically increased the quality of the recorded audio. That also made recognition (with VOSK) much more reliable.
Give it a shot and let me know how it goes :)
Steps to reproduce
wave
moduleHere's an exemplary code that shows what I do (copied together from actual source):
Expected behaviour
The written wave file should sound like the original audio source: clean and correct tempo
Actual behaviour
The written wave file sounds somewhat choppy and way too fast. audiotest.wav.zip
Recording audio from the device with
arecord -D plughw:1,0 -f cd -d 5 alsatest.wav
produces a clean result.System information
(Delete all the statements that don't apply.)
My system is Linux Mint 20.3 Cinnamon.
My Python version is 3.8.10.
My Pip version is 20.0.2.
My SpeechRecognition library version is 3.9.0.
My PyAudio library version is 0.2.13
My microphones are:
My working microphones are: