DCASE-REPO / DESED_task

Domestic environment sound event detection task
MIT License
126 stars 66 forks source link

Pretrained embeddings #45

Closed popcornell closed 2 years ago

popcornell commented 2 years ago

This PR supersedes https://github.com/DCASE-REPO/DESED_task/pull/35 Adding AST and PANNs Audioset-based pretrained models. Right now supports e2e training. Also the script to extract the embedding to use them offline is included.

popcornell commented 2 years ago

@RonFrancesca @turpaultn ready for review. Right now the code supports both fine-tuning of the model and using pre-extracted embeddings

popcornell commented 2 years ago

Once the embeddings are in dropbox and we agree on how participants can download them. I ll add instructions in the readme

turpaultn commented 2 years ago

I have uploaded the embeddings to zenodo:

popcornell commented 2 years ago

I added the instructions on the README.md on how to run the baseline with pre-trained models. Plus the results for the best model.

popcornell commented 2 years ago

@RonFrancesca can you review ?

RonFrancesca commented 2 years ago

@popcornell overwall the code is great! Thanks for the work!

In particular:

popcornell commented 2 years ago

I can't see your comments on README

RonFrancesca commented 2 years ago

I can't see your comments on README

can you see them here?

popcornell commented 2 years ago

Nope I am sorry. Did you submit your review ?

RonFrancesca commented 2 years ago

Nope I am sorry. Did you submit your review ?

Now you should be able to see them, I guess

popcornell commented 2 years ago

I should ve addressed your comments

popcornell commented 2 years ago

Can we merge this @turpaultn @RonFrancesca