m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
BSD 2-Clause "Simplified" License
12.66k stars 1.34k forks source link

Is there a way to transcribe multiple audio files asynchronously/parallel with whisperX? #861

Open imc-db opened 3 months ago

imc-db commented 3 months ago

Is there a way to transcribe multiple audio files asynchronously/parallel with whisperX?

3manifold commented 1 month ago

That's an interesting task; I can initiate that given that it is considered as a high priority feature by the repository authors. What are the chances that this is going to be merged if there is a pull request for this feature?

jonwomack commented 1 month ago

Temporary workaround I've been using:

ls /path/to/your/folder/* | xargs -n 1 -P <num_parallel> bash -c 'whisperx "$0" --language en