argmaxinc / WhisperKit

On-device Speech Recognition for Apple Silicon
https://takeargmax.com/blog/whisperkit
MIT License
3.17k stars 268 forks source link

Streaming Emulation for Files #26

Closed ZachNagengast closed 6 months ago

ZachNagengast commented 7 months ago

Needed for benchmarking the streaming functionality, as well as generally testing it's accuracy and performance. A simple loop can be made to read a file in incremental n second chunks, where the audio length increases by n seconds each loop, and the transcription is appended as the audio size increases.