Open rafikg opened 5 years ago
Whether the loss converges mainly depends on the heat map produced. The main change is to ensure that the weight value outside the bbox is zero.
Thanks @xggIoU, I think if you use the adaptive gaussian raduis from cornernet which mainly calculate the radius based on the size of the bbox, it will never had this problem. However, I have trained my DLA model around 80 epochs and the loss is still around ~4.0!!! I see that you trained few epochs and you got something on the training dataset!
@xggIoU Why the heatmap loss does not converge and what did you change? I am using DLA version and I had the same problem of convergence. I tried with mean_square_loss and focal_loss and still having convergence problem ? Some hints ?