Open BBC-Esq opened 8 months ago
Here's the final version that I ended up incorporating into my latest release, to avoid the issue, but would still be very interested in knowing if there's a way to address a single file to cause the entire batch processing of multiple files to fail...
https://github.com/BBC-Esq/WhisperS2T-transcriber/releases/tag/v1.1.0
Hey @BBC-Esq ! I think there can be a simple fix for this. I will add the fix in next release.
PS: I'm slightly stuffed with my office work. Expect some delay in the next release (end of march probably).
PPS: Next release will also include end-to-end deployment ready server for WhisperS2T !!
Hey @BBC-Esq ! I think there can be a simple fix for this. I will add the fix in next release.
PS: I'm slightly stuffed with my office work. Expect some delay in the next release (end of march probably).
PPS: Next release will also include end-to-end deployment ready server for WhisperS2T !!
Do you have time to continue to work on this repository? Ctranslate2 just implemented flash attention BTW.
When my script batch processes a bunch of audio files using the approach you gave me to use a list of files and their settings when processing, if a single file fails for any reason, it prevents the transcriptions of all files' transcriptions from being done? I created a workaround to process each file to the
transcribe_with_vad
method (each using its own tqdm) and added error handling, which works. I was wondering if there's a way to make it so I can use your most efficient approach and still have error handling for a specific audio file? Here is the original script and a comparison with the single audio file processing with error handling: