Thanks for your sharing!
Congratulations on the 1st in the ActivityNet competition! Could you please answer the following
2 questions.I did not find answers according to your notepaper(http://wanglimin.github.io/contests/XiongW_Anet16.pdf):
The model of rgb and optical flow is resnet and inceptionv3. I want to know what model you used for the audio cnn? alexnet?Train the audio cnn from scratch or finetune the model which pretrained on the imagenet
What is your sample rate when you construct your train dataset: for rgb cnn, 1fps? What about the optical flow cnn?
Thanks for your sharing! Congratulations on the 1st in the ActivityNet competition! Could you please answer the following 2 questions.I did not find answers according to your notepaper(http://wanglimin.github.io/contests/XiongW_Anet16.pdf):