Open Shangyi-Li opened 1 year ago
The variance losses are negative which made the total loss negative for me. Do you get similar results to this?
The variance losses are negative which made the total loss negative for me. Do you get similar results to this?
So it's quite normal that the total loss is negative? Well, negative variance loss is really out of my expectation, so I have never trained more than 15 epochs with MIMO_var_C. I will check the loss curves after training and keep you informed if not bothering you. I am actually not familiar with this area, and I am still curious about why a negative loss works during neural network optimization.
Thanks for replying to me so immediately. Your MIMO is an amazing work in uncertainty evaluation on 3D detection which helps me a lot. I really appreacite that you publish your code.
Yes, it is normal. If you plug the equations into wolfram alpha and enter some reasonable values you can see that the result is most likely negative. You could also just output some values where I implemented the loss functions to check the implementation.
I'm not too sure how this affects the optimization. I remember many papers that had these loss functions for 3D object detection were closed-source. Maybe they found a different way to implement it and have a positive loss?
I tried to train a pointpillar_mimo_var_c model with following changes:
and it rapidly reached negative loss.
The train parameters are default as follow:
Which I believe is exactly the same as the demo code and config files provided. Could you please help me? I wonder what parameters are proper and what did you use in the experiment in IEEE paper relevant. I wish you could spend a while to check this problem.