Cannot reproduce finetuning result on Audioset-20k

Thanks for the great work.

When I reproduce the finetuning result on Audioset-20k, the best I can get is 35.4 while the reported result is 37.0.

I use the example script in this repo.

In addition, I vary the value of the learning rate from [-1e-4, -1e-2] and the value of batch size from [4, 64], the best result remains 35.4.

Could you provide some ideas to have better results?

facebookresearch / AudioMAE