Gadersd / whisper-burn

A Rust implementation of OpenAI's Whisper model using the burn framework
MIT License
268 stars 33 forks source link

FR: sorta. Consider splitting out speech preprocessing into a separate crate #20

Open skewballfox opened 1 year ago

skewballfox commented 1 year ago

Hi. Would you consider splitting out the functionality for computing the Mel spectrogram(inside audio.rs) into a separate crate?

This would be useful for other speech-centric models. There are a few libraries for this with ndarray, such as mel-spec and mfcc-rust (contributor), but this is the first implementation I've seen for burn.