a question about training with/out VGenome

Cadene / murel.bootstrap.pytorch

MUREL (CVPR 2019), a multimodal relational reasoning module for VQA

https://arxiv.org/abs/1902.09487

BSD 3-Clause "New" or "Revised" License

194 stars 24 forks source link

a question about training with/out VGenome #10

Open archanray opened 5 years ago

archanray commented 5 years ago

Hi, I trained using VGenome, but the performance on VQA overall went down. I used the supplied yaml as config, does anything needs to be changed?

Cadene commented 5 years ago

@LousyLory Thanks for your interest :)

Which accuracy did you obtain with and without VGenome?
On which benchmark (train/val, trainval/test-dev)?
Is it the accuracy or the open ended accuracy?

archanray commented 5 years ago

Hi @Cadene, thanks for the response!

The new model under-performs in each of the categories as well as overall (yes/no, number, others and overall).
test-dev
MSCOCO-Open ended.

The following are the values I got: without-VG with-VG yes/no 83.75 82.95 number 49.3 49.21 others 56.93 57.0 overall 67.11 66.8

I understand that these are within error rates, but the second model under-performs in each and every segment of the VQA challenge on test-dev data.

Cadene commented 5 years ago

@LousyLory Thanks for this info.

I am sorry for the issue. We developed on pytorch v0.5 and released a clean code on pytorch v1.x. I will make sure that our results can be reproduce easily on this version as well. I will keep you updated.

While I focus in this issue, you could use the pretrained models of https://github.com/Cadene/block.bootstrap.pytorch as a baseline if you need to.

archanray commented 5 years ago

Cool thanks!!