Open study1994 opened 2 years ago
Hi @study1994 Here are some suggestions:
GlobalRotScaleTrans
and RandomFlip3D
when you train the LC model. These augmentations will bring some mismatch for the image-guided query initialization (although does not hurt the final performance on nuScenes) and I have already commented it out for TransFusionLC in the last commit. You can find some detailed discussion here
hillo,I trained the pandaset dataset with your code。The training went well with transfusion_pillar_L and training log see link:https://pan.baidu.com/s/1PFjYIedIrYY1qKw5SwUe1A?pwd=4cho Extraction code:4cho but bad with transfusion_pillar_LC and training log see link:https://pan.baidu.com/s/10en2ppjXWTYlhE_i4wyXqw?pwd=zxbb Extraction code:zxbb The performance of the model transfusion_pillar_LC trained on transfusion_pillar_L basis is degraded。What I'm confused about is that loss_heatmap fall first and then rise and the last matched_ious should be the highest。the MAP of the final test on data is lower than transfusion_pillar_L。 there is no problem in data validation,what do you think may be the wrong place and Can you give me some advice?