Open AdamCuellar opened 4 years ago
I had convergence issues with small batch size and was forced to use AdamAccumulate
. The initial learning rate of 0.1 and the batch size of 128 were already suspicious to me.
The missing metricas are a known issue. They are more or less a hack and do not work with tf-keras and probably not with multi GPU either. I did not have the time to fix the tf 2 training.
See also #14 and #25.
Not necessarily an issue, but the mAP I got from DSOD512 training on VOC 07+12 and testing on 07 was quite low, approximately 0.13.
Only thing I really changed was using Adam instead of AdamAccumulate because it throws an error on tf 2.0. I also used softmax.
Also, metrics don't show during training other than the loss itself.