Closed mzy97 closed 3 years ago
Our Image-level regression loss only need to normalize the ground truth depth. By contrast, MiDaS has to processing both the ground truth and predicted depth. The process can be the normalization or least square fitting. According to our ablation studies, our loss can achieve better performance.
Ok, do you have any idea why your proposed loss better than MiDaS's? In the paper, it seems just a numerical comparison but not given some explanation. Thank you
MiDaS loss has to adjust both the ground truth and prediction to a similar numerical range. I am conjunct that such adjustment may be not that stable.
what is the difference between "Image-level normalized regression loss" and shift_scale invariant loss based on median of a sample in (MiDaS). And what is the benefit of your proposed loss compared to MiDaS's.