Closed laochen closed 2 months ago
Happy man has something going on recently, so he might reply to you late.
It is best to keep all the original data in the flush action to facilitate splicing with the remaining buffer for the final recognition. The current code will cause the accuracy of the last sentence recognition. My modification suggestions voice-activity-detector.cc line:126 //int32t end = buffer.Tail() - model_->MinSilenceDurationSamples(); int32t end = buffer.Tail();
Yes, I agree with you.
Would you mind creating a pull request to fix it?
It is best to keep all the original data in the flush action to facilitate splicing with the remaining buffer for the final recognition. The current code will cause the accuracy of the last sentence recognition. My modification suggestions voice-activity-detector.cc line:126 //int32t end = buffer.Tail() - model_->MinSilenceDurationSamples(); int32t end = buffer.Tail();