msracver / Relation-Networks-for-Object-Detection

Relation Networks for Object Detection
MIT License
1.09k stars 190 forks source link

Question about "nongt_dim" #35

Closed interstate50 closed 5 years ago

interstate50 commented 5 years ago

I don't know what the meaning of "nongt_dim" is in _extract_positionembedding and _attention_moduleembedding. If it means "the number of rois that are not ground truth", how to determine which roi is not ground truth during training. Maybe I misunderstand it. Could anyone help me?

lilichu commented 5 years ago

hi! do you know how 'nongt_dim' works? https://github.com/msracver/Relation-Networks-for-Object-Detection/blob/e83e911d828e3c86624ce0aeb8d742d5ee67d5ba/relation_rcnn/symbols/resnet_v1_101_rcnn_attention_1024_pairwise_position_multi_head_16_learn_nms.py#L85 for example, nongt_dim = 2000(rois per image), when train, there is 2 images per batch, so 2000*2 rois. it will get the relation between every rois and first 2000 rois. but the rois of 2th image also get the relation about rois of 1th image. it is unreasonable.