Closed helq2612 closed 2 years ago
@helq2612 Hi, we have not paid much attention to tuning these hyper-parameters and have not tried different learning rate settings.
Thank you!
This issue is not active for a long time and it will be closed in 5 days. Feel free to re-open it if you have further concerns.
Hi ,
I have one question about the batch size and learning rate used in Anchor Detr. From the paper, the batch size = 1x8=8, and the lr =1e-4 (for backbone it is 1e-5). But comparing to Detr or Conditional Detr, they are using batch size = 2 x 8 = 16, and with the same learning rate settings as yours.
From Detr's discussion, https://github.com/facebookresearch/detr/issues/48#issuecomment-638689380, that author provides two options:
Have you tried with different learning rate settings?
Thank you!