Closed ZhenhuiTang closed 4 years ago
Besides, i have some puzzle about the result of evaluation:
best regards.
Hi, Nan is common when the learning rate is too large. You are using batch_size=1 so the gradient is noisier and you might need a smaller learning rate. You can try to reduce it and see if you still get Nan. In my experiment, I also found gradient clipping and the cyclical learning rate are helpful.
Does the "car_detection_BEV_AP"is with IOU=0.7? yes. Does the "car_detection_3D_AP"is with IOU=0.7? yes. We do not use AOS (part of the AVOD paper) or the 2D AP (IOU=0.7). Please refer to asharakeh 's implementation for what they mean.
The cyclists and pedestrians are detected by a separated model
Thanks,
Hi, Nan is common when the learning rate is too large. You are using batch_size=1 so the gradient is noisier and you might need a smaller learning rate. You can try to reduce it and see if you still get Nan. In my experiment, I also found gradient clipping and the cyclical learning rate are helpful.
Does the "car_detection_BEV_AP"is with IOU=0.7? yes. Does the "car_detection_3D_AP"is with IOU=0.7? yes. We do not use AOS (part of the AVOD paper) or the 2D AP (IOU=0.7). Please refer to asharakeh 's implementation for what they mean.
The cyclists and pedestrians are detected by a separated model
Thanks,
I'm really grateful.
Hi Weijing, I have some questions during training and evaluation. As shown below, the reg loss get nan from the first step. I just change the training dir path and set GPU and batch are 1 respectively.