argmaxinc / WhisperKit

On-device Speech Recognition for Apple Silicon
http://argmaxinc.com/blog/whisperkit
MIT License
3.92k stars 330 forks source link

WhisperKit init refactor #84

Closed jkrukowski closed 7 months ago

jkrukowski commented 8 months ago

Possible future improvements:

xiangliangX commented 7 months ago

Can the model now be used completely offline? It seems that loadTokenizer still needs to connect to hf (huggingface) to download.

jkrukowski commented 7 months ago

I'd like to avoid requiring WhisperMLModel for these because we will soon be working with MLX models that will follow the same TextDecoding protocols, but won't have associated MLModels. Everything else looks good

not much is left in this PR after this is removed so I'll close it for now