Open dslugPX opened 1 year ago
In your other issue #111 I referenced us exposing more configuration options. We've already added two:
1) VAD timeout. Once wake is activated and you start speaking, this is the amount of silence (in milliseconds) before VAD assumes you're done speaking and transcribes the captured audio. The default is 300ms but you can try extending this. Do note there is a trade-off - because it will wait longer for you to potentially finish your command the perceived latency of the command will increase by however many ms you add.
2) Maximum stream duration. This is a value (measured in seconds) that will trigger a final timeout after wake. With this value set it will only capture audio after wake for max $NUM seconds before cutting off VAD and sending the audio for processing. It's generally a good idea to have to ensure the mic doesn't get stuck open but should also help with the endless/very long VAD issue you are having. We assume the default value of five seconds should be long enough for someone to "spit out" a given command but we could certainly be wrong about that. However, like the VAD timeout you may need to kind of "dial-in" this value to ensure it's long enough for your commands and your rate of speech while also not being annoying long in the occasional instances when your background noise confuses VAD. One or more of the configuration options I mentioned previously may also help with the fundamental "background noise VAD confusion" issue.
I would suggest you check on #111 and this issue later in the day once we complete these changes so you can test with them.
In terms of the crash, that could be any number of things and we'll try to reproduce it. Essentially what is happening here is because Willow is intended to be a "no touch, no prompt" device if it ends up in some state of unknown confusion it will just reset itself to make sure it's good to try again for another command. It's one of the many "back stops" we have to ensure that even in it's early form Willow attempts to provide the best overall experience possible. We don't ever want to have to tell people to do things like "unplug it and plug it back in" - no matter how early we are.
Thanks again for reporting these!
This may better be described as command completion timing issues - not sure.
From time to time Hi ESP will activate after the wake word and one of four possible outcomes can happen.
Note: Probably some of this is related to the background noise in our house - one more issue coming in on that next.