fundamentalvision / Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.
Apache License 2.0
3.14k stars 513 forks source link

Format of groundtruth bounding box format #230

Open zhi-dian opened 4 months ago

zhi-dian commented 4 months ago

Thank you for your awesome work! I'm currently using def detr for customized dataset and get a bit confused on the format of target bbox format. According to loss function, the format should be cxcyhw. image But the getitem method of coco dataset is xyxy. image Does the format matter given it is an end-to-end set prediction? And if I want to use pretrained weight on coco dataset, which format should I use?