rmusser01 / tldw

Too Long, Didn't Watch(TL/DW): Your Personal Research Multi-Tool - Open Source NotebookLM
Apache License 2.0
44 stars 2 forks source link

Enhancement: Audio processing Pipeline #36

Open rmusser01 opened 1 month ago

rmusser01 commented 1 month ago

Find out what the expected error rate is, find out what's possible.

https://github.com/kadirnar/whisper-plus

rmusser01 commented 1 month ago

https://github.com/MahmoudAshraf97/whisper-diarization/ https://github.com/transcriptionstream/transcriptionstream https://github.com/SYSTRAN/faster-whisper https://whisperapi.com/word-error-rate-wer https://arxiv.org/abs/2311.00430 https://github.com/PyAV-Org/PyAV[ https://github.com/snakers4/silero-vad https://github.com/m-bain/whisperX https://amgadhasan.substack.com/p/sota-asr-tooling-long-form-transcription

rmusser01 commented 3 weeks ago

https://www.futurebeeai.com/blog/breaking-down-word-error-rate