Open swc-17 opened 2 weeks ago
without map condition and BEVControlNet, and set all params in UNet trainable.
We never tried to train like this. You may adjust the parameters, like learning rate or batch size, to optimize the training process.
Typically, 100 epochs should be fine in most cases. The release model is trained with 350 epochs to optimize the results for quantitative evaluation.
This issue is stale because it has been open for 7 days with no activity. If you do not have any follow-ups, the issue will be closed soon.
Hi, thanks for your great work. I build a model sharing the same camera/box encoder with Magicdrive, without map condition and BEVControlNet, and set all params in UNet trainable. After trained for 100 epochs, I found the generated images do not match with box condition well, for some boxes, the images do not have foreground objects at the corresponding location. Any suggestions on this? And I wonder how many epochs is the released model trained for? 350e as in the config? Thanks!