Anchors scaling for each feature map output

ayooshkathuria / YOLO_v3_tutorial_from_scratch

Accompanying code for Paperspace tutorial series "How to Implement YOLO v3 Object Detector from Scratch"

2.32k stars 724 forks source link

https://github.com/ayooshkathuria/YOLO_v3_tutorial_from_scratch/blob/8264dfba39a866998b8936a24133f41f12bfbdb7/util.py#L59

I have a question since yolov3 has anchors for all three different scales. (as they mentioned in paper). Why again we need to down sample the anchors for each scale. It is kind of bit hard to understand the anchors scaling as I am new to anchor based detection.

Hi, let me try to answer. Because the size of the anchor box we give corresponds to the original image, and when we predict, the obtained anchor box size corresponds to the feature map, so we need to multiply a coefficient to make the anchor box correspond to feature maps of different sizes.

ayooshkathuria / YOLO_v3_tutorial_from_scratch

Anchors scaling for each feature map output #70