Open satra opened 8 months ago
A simpler implementation for removing silence from start and end of audio files is here:
Also, a note on pyannote-audio: since the last version, it's recommended to use the speaker-diarization pipeline and not the segmentation model because it works better in terms of diarization error rate
files could have a lot of silence before or after. Add VAD functionality and add it optionally to the b2ai feature pipeline.