YuanGongND / whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
BSD 2-Clause "Simplified" License
318 stars 25 forks source link

Use with a fine-tuned model #24

Open Ar770 opened 7 months ago

Ar770 commented 7 months ago

Thanks for the great code! Can you explain if and how it's possible to train a whisper-at model based on a fine-tuned whisper model? Maybe a more general question, if we have a whisper model (transformers version), what's the process for training a model? From what I can see, the training dataset isn't in the repo. Where can we find it?