The preprocess_wav function will trim out the silences in the audio file, and return a new wav, but it did not return information about which segments were cut. If we are dealing with a video, the audio timeline is different from the video frame timeline. Therefore, we cannot process the video frame according to the diarization. Can the preprocess_wav function also return some information, and then we can restore it to the original timeline?
Hello I face the same problem as I need to have the timestamps of the speakers from the original wav file: have you progressed ln this issue? Thank you
The preprocess_wav function will trim out the silences in the audio file, and return a new wav, but it did not return information about which segments were cut. If we are dealing with a video, the audio timeline is different from the video frame timeline. Therefore, we cannot process the video frame according to the diarization. Can the preprocess_wav function also return some information, and then we can restore it to the original timeline?