Atten4Vis / MS-DETR

[CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"
Apache License 2.0
77 stars 4 forks source link

about DINO+MS-DETR #15

Open ms12 opened 1 week ago

ms12 commented 1 week ago

I tried to add MS-DETR to DINO and found that the performance dropped a lot compared to before I added it, it seems to be due to o2m supervision affecting o2o, does the author have any idea what might be causing this?

ZhaoChuyang commented 6 days ago

Yes, we find on the original DINO repo, o2m seems not compatible. We incorporate the contrastive denoising queries (CDN) from DINO into the Deformable-DETR code and it works with o2m supervision. I think the reason may lie in the loss weights and the way to average the loss, which is different in DINO and Deformable-DETR. But we did not further explore it.