Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
I have a few doubts, if you could kindly clear them
Could you clarify the number of layers in the depth map produced by the MiDaS model and how these different layers relate to the depth predictions?
What is the unit of the depth maps which are produced as output ? (pixel, mm, cm,...)?
How can we transform the output of the MiDaS model, which contains relative depth values, back to the original input resolution? This would help establish a pixel-to-depth map relation between the input image and the corresponding depth map.
I have a few doubts, if you could kindly clear them
Could you clarify the number of layers in the depth map produced by the MiDaS model and how these different layers relate to the depth predictions?
What is the unit of the depth maps which are produced as output ? (pixel, mm, cm,...)?
How can we transform the output of the MiDaS model, which contains relative depth values, back to the original input resolution? This would help establish a pixel-to-depth map relation between the input image and the corresponding depth map.