argmaxinc / WhisperKit

On-device Speech Recognition for Apple Silicon
https://takeargmax.com/blog/whisperkit
MIT License
3.17k stars 267 forks source link

Fix early stopping for VAD #155

Closed ZachNagengast closed 4 months ago

ZachNagengast commented 4 months ago

Addresses #154 #150 #152

Because shouldEarlyStop is a class variable now, we need to keep track of each window the VAD chunker is actively decoding with UUIDs. This way we can stop one of them and allow the rest to continue.