facebookresearch / AudioMAE

This repo hosts the code and models of "Masked Autoencoders that Listen".
Other
504 stars 43 forks source link

where is 2M audioset data and pretrain_audioset2M.sh? #21

Open JHjang223 opened 11 months ago

JHjang223 commented 11 months ago

Thank you meta for your hard work on the audioMAE implementation. I want to train with 2M data, but in fact, audioset is only releasing features, so I couldn't get the data. I was finally able to get 20k data from another website. Where do I download the 2M data and I can't find pretrain_audioset2M.sh. Check please.

Gariscat commented 7 months ago

Same issue...... I checked the website and also only found the features instead of the original waveforms. How should we get the raw data or the raw data is not released at all?

Jingerjia commented 6 months ago

My stupid solution is: Download the html of the class, and you'll find each video has it's youtube-id, start time, end time, and labels. Then we can download every video we need by analyzing the html of the classes. Good luck!