Hi @zylo117, first I'd like to thank you for providing such a convenient training flow using your repo.
I'm working with a foggy image dataset that contains 2 classes (car & person) with 1000 training images and 200 validation images (image size 2048 x 1280). Since I only use Tesla P100 16GB (Colab) for the training, I don't have a big chance to use d5 for high res input size.
After reading all issues and your suggestions on this repo, I chose D0 and forced the input size to 1280. The mAP and inference result was pretty good (in my opinion). But when I look at the loss curves especially the regression loss, it's clear that overfitting happened even from the early steps.
This is how the regression loss curves looks like:
This is the coco_eval result:
Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.578
Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.839
Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.629
Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.058
Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.473
Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.750
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.188
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.626
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.654
Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.058
Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.584
Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.805
This is the inference result:
I also calculated the anchors_ratio using this repo as you suggested.
Hi @zylo117, first I'd like to thank you for providing such a convenient training flow using your repo.
I'm working with a foggy image dataset that contains 2 classes (car & person) with 1000 training images and 200 validation images (image size 2048 x 1280). Since I only use Tesla P100 16GB (Colab) for the training, I don't have a big chance to use d5 for high res input size.
After reading all issues and your suggestions on this repo, I chose D0 and forced the input size to 1280. The mAP and inference result was pretty good (in my opinion). But when I look at the loss curves especially the regression loss, it's clear that overfitting happened even from the early steps.
This is how the regression loss curves looks like:
This is the
coco_eval
result:This is the inference result:
I also calculated the anchors_ratio using this repo as you suggested.
I'm new to the object detection field. So, I'd really appreciate your input about how can I fix the overfitting regression loss. Thanks in advance.
Note: I haven't applied any augmentation method, would it be one of the possible solutions?