jwyang / fpn.pytorch

Pytorch implementation of Feature Pyramid Network (FPN) for Object Detection
MIT License
952 stars 221 forks source link

rcnn_box loss equals to 0 when training #17

Open maozezhong opened 6 years ago

maozezhong commented 6 years ago

@jwyang Hello, I trained fpn with resnext backbone on my own datasets and I get zero rcnn_box loss during the training, I wonder if there is something wrong ? meanwhile, total loss is fluctuating around 0.2+ below are some output :

[session 1][epoch  1][iter 20200] loss: 0.2516, lr: 1.00e-03
            fg/bg=(2/254), time cost: 75.063063
            rpn_cls: 0.0987, rpn_box: 0.0070, rcnn_cls: 0.0789, rcnn_box 0.0000
[session 1][epoch  1][iter 20300] loss: 0.2348, lr: 1.00e-03
            fg/bg=(4/252), time cost: 71.644270
            rpn_cls: 0.1217, rpn_box: 0.0084, rcnn_cls: 0.1442, rcnn_box 0.0000
[session 1][epoch  1][iter 20400] loss: 0.2335, lr: 1.00e-03
            fg/bg=(2/254), time cost: 67.127675
            rpn_cls: 0.0929, rpn_box: 0.0019, rcnn_cls: 0.0792, rcnn_box 0.0000
[session 1][epoch  1][iter 20500] loss: 0.2261, lr: 1.00e-03
            fg/bg=(3/253), time cost: 66.649418
            rpn_cls: 0.1408, rpn_box: 0.0142, rcnn_cls: 0.1100, rcnn_box 0.0000
[session 1][epoch  1][iter 20600] loss: 0.2505, lr: 1.00e-03
            fg/bg=(4/252), time cost: 67.386812
            rpn_cls: 0.1469, rpn_box: 0.0233, rcnn_cls: 0.1441, rcnn_box 0.0000
[session 1][epoch  1][iter 20700] loss: 0.2449, lr: 1.00e-03
            fg/bg=(4/252), time cost: 67.287636
            rpn_cls: 0.1688, rpn_box: 0.0152, rcnn_cls: 0.1434, rcnn_box 0.0000
[session 1][epoch  1][iter 20800] loss: 0.2117, lr: 1.00e-03
            fg/bg=(2/254), time cost: 66.637634
            rpn_cls: 0.0515, rpn_box: 0.0037, rcnn_cls: 0.0790, rcnn_box 0.0000
[session 1][epoch  1][iter 20900] loss: 0.2153, lr: 1.00e-03
            fg/bg=(2/254), time cost: 68.067170
            rpn_cls: 0.0550, rpn_box: 0.0020, rcnn_cls: 0.0788, rcnn_box 0.0000
[session 1][epoch  1][iter 21000] loss: 0.2342, lr: 1.00e-03
            fg/bg=(4/252), time cost: 67.029715
            rpn_cls: 0.1416, rpn_box: 0.0063, rcnn_cls: 0.1458, rcnn_box 0.0000
[session 1][epoch  1][iter 21100] loss: 0.2384, lr: 1.00e-03
            fg/bg=(4/252), time cost: 67.087833
            rpn_cls: 0.1267, rpn_box: 0.0053, rcnn_cls: 0.1444, rcnn_box 0.0000
suchi007 commented 5 years ago

same error How to do it?

JingXiaolun commented 5 years ago

@maozezhong ,can you tell me the correspond solution?I have confused about it for a few days.Thanks a lot

greyring commented 5 years ago

I met the same problem.

CharlesPikachu commented 4 years ago

there are some bugs in this repo, maybe you could try this: https://github.com/DetectionBLWX/FPN.pytorch which is much easier to understand and inherits the property of this repo

HZNUJeffreyRen commented 4 years ago

@maozezhong hello, I have the same problem,can you help me?