Closed jsg921019 closed 2 years ago
Hello, although I don't find any mistakes in your description, I notice that the mlm_val_loss in your implementation is higher than our version (0.86~0.87). Our paper has released the pre-training and VQAv2 fine-tuning hyperparameters (Tables 10 & 11). Please check these settings and wait for our code & checkpoint release.
Hello, I have implemented BridgeTower architecture according to the paper and this issue based on METER github.
However, I was not able to get the result that match the paper. Below is the validation epoch loss graph for BridgeTower(blue) and METER(orange), mlm and itm respectively.
The training graph for both models are similar, even the downstream results for VQAv2 are similar
This is how i implemented BridgeTower
Is there anything wrong or anything that I missed in my implementation?? Thanks in advance.