This init func in encoder.py first sets the anchor area of the corresponding feature map (p3 --> p7):
self.anchor_areas = [32 * 32., 64 * 64., 128 * 128., 256 * 256., 512 * 512.]
and then combines with the anchor location:
I do think the anchor areas should be adjusted by the actual object size, especially when the input image is small. Given that we encode the boxes in advance, we should take care of the setting of anchor areas.
This init func in
encoder.py
first sets the anchor area of the corresponding feature map (p3 --> p7):self.anchor_areas = [32 * 32., 64 * 64., 128 * 128., 256 * 256., 512 * 512.]
and then combines with the anchor location:I do think the anchor areas should be adjusted by the actual object size, especially when the input image is small. Given that we encode the boxes in advance, we should take care of the setting of anchor areas.
Is this understanding right?