Open Liuhsinlun opened 4 months ago
I would say this library gives you a perception of depth akin to how elevation of "cards" in UI are calculated. To get real world measurements, you still need intrinsic and extrinsic data like, camera focal length etc used to calculate a transformation matrix to map 2D pixels to real world 3D memeasurements.
An article like this one about intrinsic and extrinsic matrices may explain better
@Liuhsinlun Did you solve this problem? I also want to know if the code depth = model.infer_image(raw_img) refers to the depth Wz in the camera coordinate system.
@tangjunjun966 If u want to obtain the predicted depth values in real-world units (i.e., meters), you can use the model in this folder. Its output represents the straight-line distance from the center of the image to the object being captured.
Hello, I have been researching monocular depth estimation recently. I found this place, and the documentation is very detailed, but I still have a small question that I would like the author to help me with. For each value in the depth matrix (I printed it out), do they represent the estimated real-world distance (in meters) for each pixel?
code : depth = model.infer_image(raw_img) # HxW raw depth map in numpy print("depth: \n", depth)
output :