IDEA-Research / DINO

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
Apache License 2.0
2.17k stars 234 forks source link

What is the content query of CDN queries? #8

Closed zengwang430521 closed 2 years ago

zengwang430521 commented 2 years ago

Hi, I'm confused about what the content query of CDN queries is. Is it the same as the normal learnable content queries?

Looking forward to your reply. Thanks in advance!

HaoZhang534 commented 2 years ago

The content parts of CDN queries are taken as ground-truth label embeddings with noises same as in DN-DETR.

zengwang430521 commented 2 years ago

Thanks for your reply.

owen24819 commented 1 year ago

Hi following up on this question,

I see that an indicator was used in DN-DETR but dropped in DINO-DETR. I quoted below the description of the indicator from the DN-DETR.

We also append an indicator to label embedding. The indicator is 1 if a query belongs to the denoising part and 0 otherwise.

So it was essentially used as the last dimension of the content query telling DETR if it is a denoised object or not.

Can you comment why you dropped the indicator? I couldn't find anywhere in the paper mentioning this. Also, I could not find ablation studies in DN-DETR showing that this indicator explicitly worked.

For context, I am trying to implement DN / potentially CDN in a DETR deriative.

Also very nice work!