Increasing batch size negatively impacts mAP, is it because of padding ?

microsoft / FocalNet

[NeurIPS 2022] Official code for "Focal Modulation Networks"

MIT License

682 stars 61 forks source link

Hello, I have noticed that running evaluations with batch size > 1 leads to much lower mAP, so I was wondering if the reason is because the model (large fl4 with 5scale DINO) was trained with only 1 image per GPU ? It is not specified in focal-dino's README and I would like to make sure this is indeed the reason.

And as an additional question, does someone know why increasing the batch size does not improve the inference speed / image ? I just know that it is not because of focalnet backbone, because I have observed the same effect with resnet50 and swin backbones.

microsoft / FocalNet

Increasing batch size negatively impacts mAP, is it because of padding ? #47