beeldengeluid / whisper-asr-worker

MIT License
2 stars 0 forks source link

Decide on Whisper implementation #41

Closed Veldhoen closed 3 months ago

Veldhoen commented 4 months ago

Overview of computational performance (time & memory - also for longform) ASR performance (WER) Project quality (well maintained software, no weird depencencies etc)

Presumably Faster Whisper or WhisperX

greenw0lf commented 4 months ago

Proposal: Use the PR from faster-whisper that introduces batching + add the piece of code from WhisperX that does diarization

Reasons why:

greenw0lf commented 3 months ago

I wrote a document that details the decision: https://docs.google.com/document/d/17rVrum6L8qlsewoR36ricKwTtQMVWrayg8xfDkmPMRY/edit?usp=sharing