qiuqiangkong / audioset_tagging_cnn

MIT License
1.35k stars 255 forks source link

Get embedding not classification #51

Open newusersayshi opened 2 years ago

newusersayshi commented 2 years ago

Is there an implementation of this anywhere that can be used to ouput embeddings of audio using any of th epretrained models, rather than classifications, so we could use these to train our own classifiers (e.g random forests) using these embeddings? Similar to how you can easily get a 128 embedding using VGGish.

lbroad-tm commented 1 year ago

If you view the models or their outputs, there are embeddings available.