facebookresearch / AudioMAE

This repo hosts the code and models of "Masked Autoencoders that Listen".
Other
504 stars 43 forks source link

reproduce inference results #19

Open hanlin-lu opened 11 months ago

hanlin-lu commented 11 months ago

Hello! Thank you for the great work.

I am trying to reproduce the inference results. But just run "inf.sh" with your released checkpoints, I have the following results: Inf with pretrained.pth, I have 0.006102 mAP, which totally makes sense to me. Inf with finetuned.pth, I have 0.3388 mAP, which is much lower than your reported mAP of 0.4729.

The only difference I think should be the audioset data only.

Could you please give me any hint on the much lower inference mAP numbers?

Thanks a lot!