Closed sarmientoj24 closed 2 years ago
@sarmientoj24 sorry, I only use v5's default parameters during training. if backbone not use pretrained weights, I think need more epoch
also do you have some idea why the AP and recall for medium are zero when using swin transformer?
YOLO
Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.640
Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.778
Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.751
Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = -1.000
Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.700
Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.640
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.750
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.839
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.839
Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = -1.000
Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.700
Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.839
Epoch: 19 | mAP@0.5: 0.7780083973202115 | mAP@0.50:0.95: 0.6397958600001571
YOLO + SWIN
Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.449
Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.625
Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.547
Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = -1.000
Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.000
Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.449
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.590
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.719
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.720
Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = -1.000
Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.000
Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.721
Epoch: 19 | mAP@0.5: 0.625379201780706 | mAP@0.50:0.95: 0.4491794728277074
@Bobo-y i was able to use SwinV2 with YOLOv6 and have it on par. however, do you have some idea why it could be that the precision and recall are so low for area=medium
?
swinv2 - yolov6n
Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.587
Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.749
Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.702
Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = -1.000
Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.008 <-------------------- really low
Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.587
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.732
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.801
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.801
Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = -1.000
Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.100 <-------------------- really low
Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.802
yolov6n
Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.660
Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.793
Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.759
Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = -1.000
Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.700
Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.661
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.795
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.842
Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.842
Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = -1.000
Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.700 <-------------------- high
Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.842
not researched, i don‘t know
I was successful in plugging in your Swin Transformer in YOLOv6 by meituan. Compared it with YOLOv6n using Tiny-Swin T + FPN + YOLOv6's neck and Detect layers.
While I am getting results, it is about 0.15 to 0.2 lower in mAP on the experiment when I ran it for 20 epochs for now.
Any tips on this such as: