Closed jwijffels closed 8 months ago
Multiple offsets (e.g. audio where non-silence data is identified). Note that whisper.cpp does seem to return the whole segment which the offset+duration chunks belongs to and minimum duration looks to be at least 1.1 secs
Multiple offsets (e.g. audio where non-silence data is identified). Note that whisper.cpp does seem to return the whole segment which the offset+duration chunks belongs to and minimum duration looks to be at least 1.1 secs