How can we integrate result[segments] from the original whisper transcribe to get timestamps associated with the transcription?

BriansIDP / WhisperBiasing

MIT License

63 stars 2 forks source link

How can we integrate result[segments] from the original whisper transcribe to get timestamps associated with the transcription? #9

Open vishakha-lall opened 6 months ago

vishakha-lall commented 6 months ago

Since we aren't using the transcribe.py module from Whisper, I was wondering if we have explored obtaining the timestamps for the transcription with the contextual biasing? For instance if I am looking to produce the following transcription output "<|startoftranscript|><|ja|><|translate|><|0.00|> He has grave doubts whether Sir Frederick Layton's work is really Greek after all and<|6.24|><|6.24|> can discover in it but little of rocky Ithaca.<|9.44|><|endoftext|>"

BriansIDP commented 6 months ago

Hi. No, we haven't explored this. That's an interesting thing to explore and I'd be very interested in seeing how it works with time stamps.