Do you know why just one small change (1 line) greatly improves accuracy +0.7 on Pytorch-yolo?

WongKinYiu / CrossStagePartialNetworks

Cross Stage Partial Networks

https://github.com/WongKinYiu/CrossStagePartialNetworks

894 stars 172 forks source link

Do you know why just one small change (1 line) greatly improves accuracy +0.7 on Pytorch-yolo? #9

Open AlexeyAB opened 4 years ago

AlexeyAB commented 4 years ago

@WongKinYiu Hi,

Do you know why just one small change (1 line) greatly improves accuracy + 0.7 on Pytorch-yolo? https://github.com/WongKinYiu/CrossStagePartialNetworks/blob/pytorch/README.md

Model	Size	NMS	1080ti fps	BFLOPs	AP	AP50	AP75	cfg	weight
CSPResNeXt50-PANet-SPP	512×512	0.5	44	71.331	39.2	59.5	41.8	cfg	-
CSPResNeXt50c-PANet-SPP	512×512	0.5	-	71.734	39.9	60.1	42.6	cfg	-

Full diff:

WongKinYiu commented 4 years ago

@AlexeyAB

Hello, 273 epochs is not sufficient for training from scratch of panet. From the log file, I think it not yet converge.

In rethinking imagenet pre-training, they also shows that training form scratch need more epochs.

AlexeyAB commented 4 years ago

So +-0.7 AP can be just fluctuation on early stages of training. Or partial-residual connections require more iterations for training than common-residual connections?

WongKinYiu commented 4 years ago

I think both yes.