Use Audioset20K for fine tuning

Hi,

Thank you for your nice work. I would like to ask why I get such a low mAP and AUC when using Audioset 20k dataset for fine-tuning in SSAST model?

For the dataset I took the preprocessing of resampling to 16kHz and the batch_size was set to 8 (12 in the original code) and the Epoch was also 25 rounds, but the mAP I got was only 0.004844 and the AUC was only 0.499981, which is unbelievable!

I wonder what your mAP and AUC are when fine-tuning with Audioset20K?

Looking forward to your reply, thanks!

result

YuanGongND / ssast

Use Audioset20K for fine tuning #38