Closed Veldhoen closed 3 months ago
Proposal: Use the PR from faster-whisper that introduces batching + add the piece of code from WhisperX that does diarization
Reasons why:
I wrote a document that details the decision: https://docs.google.com/document/d/17rVrum6L8qlsewoR36ricKwTtQMVWrayg8xfDkmPMRY/edit?usp=sharing
Overview of computational performance (time & memory - also for longform) ASR performance (WER) Project quality (well maintained software, no weird depencencies etc)
Presumably Faster Whisper or WhisperX