Batch processing - Githubissues

Blair-Johnson / batch-whisper

Batch Support for OpenAI Whisper

MIT License

82 stars 21 forks source link

Opening PR for merging and conflict resolution. The batch-processing branch introduces a single major modification to the behavior of the model.transcribe() method, which can now accept a list of audio file paths rather than a single audio file path. These files are packed into the batch dimension of the model for transcription, allowing users to achieve better GPU utilization. Audio clips can be different lengths and the internal batch size will be reduced as the transcription of sorter files is completed.

Remaining issues to address:

[ ] Test behavior of different transcription options for failure
[x] Modify docstring of batch_transcribe method to reflect modifications
[ ] Verify behavior of decode options for different clips
[x] Include some benchmarking results

Blair-Johnson / batch-whisper

Batch processing #1