collabora / WhisperLive

A nearly-live implementation of OpenAI's Whisper.
MIT License
2.09k stars 283 forks source link

Timestamping transcriptions? #234

Open stuartpb opened 5 months ago

stuartpb commented 5 months ago

Would it be possible to add some option to delimit the transcribed output as timestamp-prefixed lines, or some other mark/metadata when each word occurs in the source media?

This is the way I was thinking I could hack it, if there isn't any way to surface this from the lower-level implementation:

stuartpb commented 5 months ago

I see now that #211 links to a fork with word-level timestamps: it looks like someone still needs to submit a pull request?