bnosac / audio.whisper

Transcribe audio files using the "Whisper" Automatic Speech Recognition model from R
Other
118 stars 13 forks source link

Multiple offsets #58

Closed jwijffels closed 8 months ago

jwijffels commented 8 months ago

Multiple offsets (e.g. audio where non-silence data is identified). Note that whisper.cpp does seem to return the whole segment which the offset+duration chunks belongs to and minimum duration looks to be at least 1.1 secs