DTaoo / VGGish

An implementation of vggish in keras with tf backend
117 stars 38 forks source link

Could you please provide the details of the pre-trained models? #1

Open silhouettesia opened 6 years ago

silhouettesia commented 6 years ago

Did you train the models on the AudioSet for AED? How did you get the raw audio dataset? Could you explain how to train the model without the fc layers? Thanks!

silhouettesia commented 6 years ago

or you just turn the released tensorflow trained model into keras ?

DTaoo commented 6 years ago

Yes, I just simply turn the released TF trained model into Keras, without any training on the AudioSet.

xav12358 commented 5 years ago

@DTaoo Is the repo for train the Vggish model ? Or is it to use the pretrained VGGish model?

rola93 commented 5 years ago

I'm late for the party... but will answer it anyway, hope it's useful for someone else.

@xav12358 It is for both... you can initialize the model with random weights and retrain them from scratch, or just use the given weights, or even retrain from the given weights (aka fine tuning). This repo provides a Keras version of the VGGish model, and let you load it with the given weigths, then you can do whatever you want with it. The code for train/fine tune the model is not given but is not that difficult.

I'll try to upload one example later

@DTaoo shouldn't this issue be clossed?

Jacob-Stevens-Haas commented 4 years ago

It might be nice if the README.md had a link to where to find vggish_audioset_weights.h5 and vggish_audioset_weights_without_fc2.h5. The current link to TensorFlow's version points to a tensorflow checkpoint file for the weights, which can't be directly loaded into keras.