Thank you very much for your code, it's very helpful. But it seems that you use resnet as a seperate feature extractor instead of updating it when training the VQA model. Could I ask you the reason of that? Intuitively, it might give better results to train the feature extractor and the VQA model together, since the size of training data is not small.
Hi, Cadene,
Thank you very much for your code, it's very helpful. But it seems that you use resnet as a seperate feature extractor instead of updating it when training the VQA model. Could I ask you the reason of that? Intuitively, it might give better results to train the feature extractor and the VQA model together, since the size of training data is not small.
Thank you in advance.