natrys / whisper.el

Speech-to-Text interface for Emacs using OpenAI's whisper model and whisper.cpp as inference engine.
140 stars 10 forks source link

Examining the transcription #25

Closed agzam closed 1 month ago

agzam commented 3 months ago

Very cool package. I haven't installed and tried it yet, I will shortly. I immediately see an important, personal use case for it.

After transcribing an audio file, it would be nice to present the transcription in a buffer where timestamps are not part of the text but rather overlays. And would be nice to be able to jump to the exact point in the audio from the text, similarly how elfeed-tube does with mpv, or even better - jump to the point in a karaoke style video-file.

This isn't a request, merely an idea. If I get the time to fool myself around, I'll concoct a prototype.

agzam commented 1 month ago

Oh, wow. The stuff already exists. One just needs to transcribe it into .srt/.vtt format and then open it using https://github.com/sachac/subed. Sacha Chua is awesome.