Open shirayu opened 2 years ago
Might there be any progress on this? Also, how could this be made to work with more than 30 seconds and with arbitrary splits? This would be great for continuous recognition.
Hi, I haven't worked on this yet, while I added --frame
argument on v0.6.4.
Thank you for your interest.
Whisper assumes a 30-second interval as an input. So, whispering does not request analysis from whisper until 30 seconds have elapsed without
--allow-padding
. However, it is useful to show temporary transcriptions for short intervals.