argmaxinc / WhisperKit

On-device Speech Recognition for Apple Silicon
https://takeargmax.com/blog/whisperkit
MIT License
3.17k stars 267 forks source link

Regression Test Pipeline #120

Open Abhinay1997 opened 5 months ago

Abhinay1997 commented 5 months ago
Abhinay1997 commented 4 months ago

Code is still messy. Needs cleanup once the normalization starts working.

Abhinay1997 commented 4 months ago

Running tests locally. Adding more unit tests for the new normalization code.

Abhinay1997 commented 1 month ago

@atiorh made the changes except for the AudioEncoder latency stats. Need to add a callback for that. Discussing with Zack on this.