nianticlabs / simplerecon

[ECCV 2022] SimpleRecon: 3D Reconstruction Without 3D Convolutions
Other
1.28k stars 120 forks source link

I have a quick question #16

Closed Wooho-Moon closed 1 year ago

Wooho-Moon commented 1 year ago

At first, thanks for awesome works! I have a quick question. I already read your paper and am impressed on it.

I have a quick question. According to your paper, model take as input reference image, a set of source image, their intrinsics and relative camera pose. If I use only single image as input during inference, could I get similar resluts( e.g. depth map )? I mean , if i don't have any metadata, and just have only single image, Could I use this model?

mohammed-amr commented 1 year ago

You could try passing passing identities for the source poses and manually add zeros for source view features. The result might be sharp, but won't have a good metric depth estimate.

If this isn't an edge case in a video sequence, and you're just looking for a relative depth map for a single image, I'd encourage you look for a SOTA monodepth model instead. Here's MIDAS: https://github.com/isl-org/MiDaS

Thanks.

Wooho-Moon commented 1 year ago

Thanks for reply

mohammed-amr commented 1 year ago

Welcome, hope it helped!