argmaxinc / WhisperKit

On-device Speech Recognition for Apple Silicon
https://takeargmax.com/blog/whisperkit
MIT License
3.17k stars 267 forks source link

MLX text decoder #161

Closed jkrukowski closed 3 months ago

jkrukowski commented 3 months ago

In this PR:

Tested with https://huggingface.co/jkrukowski/whisper-base-mlx-safetensors and https://huggingface.co/jkrukowski/whisper-tiny-mlx-safetensors models. Tested different pipelines with combinations of MLX and Core decoder / encoder.

Possible improvements: