Closed heydarshahi closed 7 months ago
Hi,
The training was on the cropped images which is a lot faster. The bottleneck is the hard drive speed so it will be much faster if you have an sdd.
For losses, yes i also observed similar issues, it is mainly because of the camera parameters are quite difficult to estimate.
I suggest to take a look at the joint transformer in our latest paper: https://arxiv.org/abs/2403.16428
It has sota performance and it has code release.
Thanks a lot for the quick response!
Following #5, I noticed my training times are very different from yours. E.g., each epoch takes ~8 hours on the full data with twice your batch size (128) and the default parameters on an A100 GPU.
Thanks! Amin