Add VAD support - Githubissues

sensein / b2aiprep

Apache License 2.0

7 stars 6 forks source link

Add VAD support #10

Open satra opened 8 months ago

satra commented 8 months ago

files could have a lot of silence before or after. Add VAD functionality and add it optionally to the b2ai feature pipeline.

GasserElbanna commented 8 months ago

Some options to consider:

https://huggingface.co/pyannote/brouhaha
https://huggingface.co/pyannote/segmentation
https://huggingface.co/speechbrain/vad-crdnn-libriparty

fabiocat93 commented 8 months ago

A simpler implementation for removing silence from start and end of audio files is here:

https://pytorch.org/audio/main/generated/torchaudio.transforms.Vad.html

Also, a note on pyannote-audio: since the last version, it's recommended to use the speaker-diarization pipeline and not the segmentation model because it works better in terms of diarization error rate