Open eschmidbauer opened 1 month ago
Hi @eschmidbauer ,
It should be possible, but seems like we'll need to make some modifications to the transcribe
function:
Currently, it runs on a single 30s window.
John
It would be great to demonstrate long-form here perhaps by using sliding window
Is long form inference possible with
whisper_trt
? I tried inference on 4m16s audio clip and it appeared to only transcribe 30s, here is my script: