Open troylhy1991 opened 2 months ago
Yes. Detection requires long time to converge. 12 epoch can obtain a relative good results. 24 epochs for full convergence.
many thx! turns out i was using default lr 1e-4 while my batchsize was only half. but it kind of introduces another confusing part to me: the planning sub task might be too easy to overfit ...... ;(
the planning evaluation is close to what the paper reported after training for 6 epochs. however, the detection scores are very low, is this expected? ;(