Box_Mask question - Githubissues

I think these codes are not necessary if you follow #23. You can remove all these default number. Just let it initialized to zeros: _boxes = np.zeros([hw, num_anchors, 4], dtype=np.float) _box_mask = np.zeros([hw, num_anchors, 1], dtype=np.float)

In my experiments(w/ or w/o these default values), the trained model mAP is almost the same.

I think these numbers come from: delta_region_box coord_scale * (2 - truth.w*truth.h)

Not really sure why YOLO's author did it. net.seen is the number of images seen by network in training process. 12800 is roughly 0.8 epoch (VOC07+12 has 16,651 images). Maybe it's kind of curriculum learning I guessed.

longcw / yolo2-pytorch

Box_Mask question #21