Open zlmlaker opened 7 years ago
I've read the code and agree with you. According to YOLO9000 paper, "Multi-Scale Training" might be able to mitigate this problem. Introducing some kind of randomness in the ground truth bbox selection might help too.
hi, I found this bug yesterday, it seems the author just simply copy the code from yolo1. Anyone solved this problem?
The _batch func in data.py seems to neglect some cases where there are more than one objects centered in the same cell. The produced label only takes the last object into consideration. So the network only cares about one image in the each cell during training. It seems a bug to me because in darknet each gt-box is treated independently even if there are more than one object in a cell.