Detail about object detection decoder.

Hi, there. I believe GLEE is a great work, thanks for open source! I have a question about object detection: what's the input to the decoder when used as a object detector? Does it need to input object query including box position from anchor boxes? If I'm not wrong, in MaskDINO, it will input box position from anchor and mask as object query. So, what's the object query like in GLEE when used as object detector? Looking forward for your reply, thanks a lot!

FoundationVision / GLEE

Detail about object detection decoder. #16