Single loss is better? - Githubissues

NVlabs / PWC-Net

PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume, CVPR 2018 (Oral)

Other

1.63k stars 357 forks source link

Hi, I am trying to train PWC-net from scratch with Flying Chairs. However, during the training, I found that: 1) the model converges faster if only the loss from the bottom of the pyramid is used. I mean, training with single photometric loss that calculated from level-2 seems perform better than the training with those losses from Level2/3/4/5/6 with weights described in the original paper.

2) when I am training the model with losses from Level2, 3, 4, 5 and 6 with its weights (as described in paper), the model converges extremely slow. What i have done is to sum these losses up, then do BP. Am I right? Why the training loss always goes up and down? The curve is as follows.

Screenshot from 2020-01-05 13-07-55

NVlabs / PWC-Net

Single loss is better? #100