Closed XiaoyanLi1 closed 4 years ago
Hi @XiaoyanLi1 ,
I get 36.1 mAP for IOU=0.5, and my hyper-parameter setting is:
cam_batch_size = 32
cam_batch_size = 0.1
irn_batch_size = 16
irn_learning_rate = 0.1
Maybe with different random seed or different batch_size/learning_rate, we can approach the reported performance.
Hi @XiaoyanLi1,
You should adjust the learning rates according to the batch sizes. Thanks @djiajunustc for the comment! The number of random walks iterations is computed as 2^{exp_times}. Hence, setting exp_times=8 means 256 iterations. Check to_transition_matrix in misc/indexing.py.
Here's what I've just got with
And I'm also suspecting skimage.transform.rescale in voc12/dataloader.py has some glitches when using torchvision resnet. I'm gonna run a few tests about this and update the repository.
@jiwoon-ahn @djiajunustc Thanks a lot for your comments. I'll try these settings!
Hi, I tried to adopt pseudo label as GT and train an end-to-end Mask R-CNN. But I failed to get comparable results as yours. Could you provide some more details such as input size, learning rate, batch size, max iteration ?
Thanks!
Besides, whether 'train' set or 'train_aug ' set is supposed to be adopted?
@djiajunustc, I have just updated the code: Pillow resizing function is now used when loading data. Please try again with the same setting. The performance may be slightly different (<1.5%) every time you run the code depending on the quality of CAMs. Use 'train' for evaluating the quality of the pseudo labels as there does not exist any ground-truth of 'train_aug'.
@jiwoon-ahn Thanks! But we can also generate pseudo label on 'train_aug' to train the Mask R-CNN?
@djiajunustc, Yes, I actually trained Mask R-CNN with the pseudo labels on 'train_aug' for the reported results. I'll add that to README.md later.
Hi Jiwoon Ahn, Your paper is very good and I'm really interested in it. I've already tried your code, but I cannot achieve the same performace as the paper. Would you please help me figure out where the problem is?
The CAM models have similar performace, but there are performance gaps between IRN models in both task.
There may be two possible reasons for the gap.
Would you please point out the differences between my experiments and yours that may results in the gap? Thank you!