sebastianruder / NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
https://nlpprogress.com/
MIT License
22.43k stars 3.61k forks source link

Voice Activity Detection #268

Open unhammer opened 5 years ago

unhammer commented 5 years ago

I didn't see anything on VAD, so maybe that should be a new category? I don't know enough about it to say if it could be considered a language independent task, nor what the current state of the art is (which is why I'm opening this issue ;-))

It does seem like webrtc-vad is used a lot, so that might be the de-facto baseline, while https://ieeexplore.ieee.org/document/8309294 / https://github.com/jtkim-kaist/VAD seems like a contender for state-of-the-art (has a freely available dataset).

sebastianruder commented 5 years ago

Thanks for the mention. We could potentially add this to a speech-related section if there's interest.