kjwkch commented 1 year ago

hello. I am studying 2DPASS with code. It seems that the modality fusion implementation is in network/arch_2dpass.py from line 100 to line 105. In the thesis, bitwise add is specified, but I do not see it in the code, so I ask a question. Below is the code.

modality fusion

feat_learner = F.relu(self.leanersidx) feat_cat = torch.cat([img_feat, feat_learner], 1) feat_cat = self.fcs1idx feat_weight = torch.sigmoid(self.fcs2idx) fuse_feat = F.relu(feat_cat * feat_weight)

I think that [fuse_feat = F.relu(feat_cat*feat_wieght) + img_feat] implements the formula in the paper as a code. Isn't it?

brahami14 commented 1 year ago

i have the same question

thanks in advance

LiXiang0021 commented 1 year ago

Have you guys successively reproduced the model on Nuscenes, I did several experiments but the performance is far away from the report results. And, I also tested the provided weight on Nuscences getting results similar to the reported results. I'd like to know if I forgot to set some arguments.

kjwkch commented 1 year ago

This issue is code implementation, not performance. I think open code is not implemented as described in the paper.

kjwkch commented 1 year ago

Missing in the code.

modality fusion

feat_learner = F.relu(self.leaners[idx](pts_feat))
feat_cat = torch.cat([img_feat, feat_learner], 1)
feat_cat = self.fcs1[idx](feat_cat)
feat_weight = torch.sigmoid(self.fcs2[idx](feat_cat))
fuse_feat = F.relu(feat_cat * feat_weight)

I think that [fuse_feat = F.relu(feat_cat*feat_wieght) + img_feat] implements the formula in the paper as a code.

feat_learner = F.relu(self.leaners[idx](pts_feat))
feat_cat = torch.cat([img_feat, feat_learner], 1)
feat_cat = self.fcs1[idx](feat_cat)
feat_weight = torch.sigmoid(self.fcs2[idx](feat_cat))
fuse_feat = F.relu(feat_cat * feat_weight) + img_feat

LiXiang0021 commented 1 year ago

Thanks for your reply, I will further check this issue.

LiXiang0021 commented 1 year ago

I just trained the modified version as you said, and the performance did improve a little bit around 2 on mIoU. I believe there may be some other wrong implements or missing in the released code. And thank you again.

jaywu109 commented 1 year ago

@kjwkch @LiXiang0021 After reviewing the current implementation, I noticed that besides the fusion modification, the point feature pass through the 2D learner needs to be added to the original point feature before passing through multihead_3d_classifier, in order to match the model architecture outlined in the paper as below:

        feat_learner = F.relu(self.leaners[idx](pts_feat)) 
        # feat_learner -> voxel-wise feature after 2D learner

        pts_pred_full = self.multihead_3d_classifier[idx]((pts_feat+feat_learner)) 
        # pts_feat+feat_learner -> voxel-wise Enhanced 3D Features

        # correspondence
        pts_label_full = self.voxelize_labels(data_dict['labels'], data_dict['layer_{}'.format(idx)]['full_coors'])
        pts_pred = self.p2img_mapping(pts_pred_full[coors_inv], point2img_index, batch_idx)

        # modality fusion

        feat_learner = self.p2img_mapping(feat_learner[coors_inv], point2img_index, batch_idx)
        # feat_learner -> point-wise feature after 2D learner and img_mapping

        feat_cat = torch.cat([img_feat, feat_learner], 1)
        feat_cat = self.fcs1[idx](feat_cat)
        feat_weight = torch.sigmoid(self.fcs2[idx](feat_cat))
        fuse_feat = F.relu(feat_cat * feat_weight) + img_feat

Currently, the implementation takes the point feature as input directly for multihead_3d_classifier instead of adding the point feature after the 2D learner. https://github.com/yanx27/2DPASS/blob/80b8646bbb0dd46ddcee31f531d6f1c45baf35fe/network/arch_2dpass.py#L89-L93

@yanx27, I would appreciate any suggestions you may have regarding this matter.

brahami14 commented 1 year ago

@jaywu109 does the new script changes work for you ?

yanx27 / 2DPASS

Modality fusion implementation question #53

modality fusion

modality fusion