Pretrained models - Githubissues

YuanGongND / psla

Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".

BSD 3-Clause "New" or "Revised" License

139 stars 16 forks source link

Hi Annalisa,

Thanks for your interest.

This is the one (without weight averaging) I have on my server. Note it is a model created by my experiment code, I did a code cleanup before I release the code, so it might not fit the released code. You could have a try.

In one of my new projects, I have a new implementation of the mean pooling model using the torchvision implementation. You can play with it at https://colab.research.google.com/github/YuanGongND/vocalsound/blob/main/colab/VocalSound.ipynb. Nevertheless, the model is not pretrained with AudioSet.

-Yuan

YuanGongND / psla

Pretrained models #7