YuanGongND / whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
BSD 2-Clause "Simplified" License
318 stars 25 forks source link

JAX-models #23

Open WhyAreYouJay opened 7 months ago

WhyAreYouJay commented 7 months ago

Is transcription with JAX-able models and subsequent labeling with whisper-at possible? Or does the transcription result need to be from whisper-at? Im wondering because of how fast whisper-jax is, I could use it to transcribe but use your model to label (speech, laughter ect.) afterward.