alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Apache License 2.0
7.37k stars 1.04k forks source link

nodejs with microphone detects something when no one is speaking #1488

Open jrichardsz opened 6 months ago

jrichardsz commented 6 months ago

Just after start this sample: https://github.com/alphacep/vosk-api/blob/master/nodejs/demo/test_microphone.js

The log is

LOG (VoskAPI:ReadDataFiles():model.cc:213) Decoding params beam=13 max-active=7000 lattice-beam=6
LOG (VoskAPI:ReadDataFiles():model.cc:216) Silence phones 1:2:3:4:5:6:7:8:9:10
LOG (VoskAPI:RemoveOrphanNodes():nnet-nnet.cc:948) Removed 0 orphan nodes.
LOG (VoskAPI:RemoveOrphanComponents():nnet-nnet.cc:847) Removing 0 orphan components.
LOG (VoskAPI:ReadDataFiles():model.cc:248) Loading i-vector extractor from /home/computer/Github/speech_to_text_sandbox/python/offline_vosk/python_offline_vosk/vosk-model-it-0.22/ivector/final.ie
LOG (VoskAPI:ComputeDerivedVars():ivector-extractor.cc:183) Computing derived variables for iVector extractor
LOG (VoskAPI:ComputeDerivedVars():ivector-extractor.cc:204) Done.
LOG (VoskAPI:ReadDataFiles():model.cc:279) Loading HCLG from /home/computer/Github/speech_to_text_sandbox/python/offline_vosk/python_offline_vosk/vosk-model-it-0.22/graph/HCLG.fst
LOG (VoskAPI:ReadDataFiles():model.cc:294) Loading words from /home/computer/Github/speech_to_text_sandbox/python/offline_vosk/python_offline_vosk/vosk-model-it-0.22/graph/words.txt
LOG (VoskAPI:ReadDataFiles():model.cc:303) Loading winfo /home/computer/Github/speech_to_text_sandbox/python/offline_vosk/python_offline_vosk/vosk-model-it-0.22/graph/phones/word_boundary.int
LOG (VoskAPI:ReadDataFiles():model.cc:310) Loading subtract G.fst model from /home/computer/Github/speech_to_text_sandbox/python/offline_vosk/python_offline_vosk/vosk-model-it-0.22/rescore/G.fst
LOG (VoskAPI:ReadDataFiles():model.cc:312) Loading CARPA model from /home/computer/Github/speech_to_text_sandbox/python/offline_vosk/python_offline_vosk/vosk-model-it-0.22/rescore/G.carpa
<Buffer@0x1262d770 52 49 46 46 24 00 00 80 57 41 56 45 66 6d 74 20 10 00 00 00 01 00 01 00 80 3e 00 00 00 7d 00 00 02 00 10 00 64 61 74 61 00 00 00 80>
{ partial: '' }
<Buffer@0x12631770 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ... 3950 more bytes>
{ partial: '' }
<Buffer@0x12632720 01 00 01 00 01 00 02 00 02 00 02 00 01 00 00 00 ff ff 00 00 00 00 00 00 01 00 02 00 01 00 02 00 02 00 00 00 02 00 02 00 01 00 02 00 01 00 01 00 02 00 ... 3950 more bytes>

The Buffer line is showing each second (or less). I'm not speaking. I'm in a empty room.