showlab / VLog

Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
MIT License
528 stars 26 forks source link

whisperX has better alignment #2

Closed tensorboy closed 1 year ago

tensorboy commented 1 year ago

https://github.com/m-bain/whisperX

QinghongLin commented 1 year ago

Thanks for sharing this. We recently are investigating several audio toolboxes, and whisperX being the powerful one!