Open AlexeyAB opened 4 years ago
@AlexeyAB
Hello, 273 epochs is not sufficient for training from scratch of panet. From the log file, I think it not yet converge.
In rethinking imagenet pre-training, they also shows that training form scratch need more epochs.
So +-0.7 AP can be just fluctuation on early stages of training. Or partial-residual connections require more iterations for training than common-residual connections?
I think both yes.
@WongKinYiu Hi,
Do you know why just one small change (1 line) greatly improves accuracy
+ 0.7
on Pytorch-yolo? https://github.com/WongKinYiu/CrossStagePartialNetworks/blob/pytorch/README.mdFull diff: