r4victor / syncabook

📖🎧 A tool for creating ebooks with synchronized text and audio (EPUB3 with Media Overlays)
MIT License
274 stars 27 forks source link

Utilize Whisper AI for more accuracy? #27

Open ndevries84 opened 1 year ago

ndevries84 commented 1 year ago

Have you looked at utilizing whisper AI to improve accuracy and speed https://github.com/m-bain/whisperX/

Nate

r4victor commented 1 year ago

@ndevries84, thanks for the link! syncabook is based on afaligner, which works by aligning audio. I haven't considered ASR at the time because it was more compute-intensive. Probably, the next syncabook version should be based on ASR.