noahzn / Lite-Mono

[CVPR2023] Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation
MIT License
527 stars 58 forks source link

The edges of the image are foggy and blurry during the training process #134

Closed LLLYLong closed 4 months ago

LLLYLong commented 5 months ago

Thanks to the author for such an excellent job! I encountered a problem in the process of modifying the code for training, the edges of the generated disp are blurred and the whole disp image is not displayed very clearly, I would like to consult the author what could be the reason for this。 disp

noahzn commented 5 months ago

Hi, how many rounds of epochs did you train to get this result? The result looks reasonable to me. Did you load the pre-trained ImageNet model?

LLLYLong commented 5 months ago

Hi, thanks for the reply from the author.

  1. This is the result after I finished the training, loaded with pre-trained weights, and looking at the predicted depth map in the senses is not very clear.

  2. Here is a transcript of my training process, looking at the convergence is not particularly perfect, and I would like to ask the author if this is normal? loss

  3. I've tried to make some modifications to change the decoder, but my training loss has become more jarring, and would like to ask some of the authors what might be wrong, and in which areas might I try to make changes? loss——2

noahzn commented 5 months ago

Hi, the figures are not accurate (see the comment here: https://github.com/noahzn/Lite-Mono/blob/main/trainer.py#L541), and they are only rough references to ensure that your training is correct. The figures look fine to me. However, if you say that you have already loaded pre-trained weights for training, then I am afraid that the depth maps should not be of such low quality. Which weights did you load? What arguments did you use for training?

noahzn commented 4 months ago

I'm now closing this issue as there is no response.