IDEA-Research / DINO

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
Apache License 2.0
2.08k stars 228 forks source link

Experiment on Pascal VOC #264

Open JiaPiangLi opened 3 days ago

JiaPiangLi commented 3 days ago

I have tried to train DINO-4Scale-R50 on Pascal VOC Dataset and only load the pretrain model of resnet 50 from torchvision. The AP0.5 is 0.585 at epoch 30, which is much lower than the benchmark of VOC. Does that mean the scale of VOC is still too small for DINO training or I have missed some other issues? My configurations are as following. GPU: RTX 3090 Batchsize: 4 lr: 1e-4 lr_backbone: 1e-5