kxhit / EscherNet

[CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis
https://kxhit.github.io/EscherNet
Other
299 stars 16 forks source link

Question about equation 2 #24

Open supersyz opened 2 days ago

supersyz commented 2 days ago

Thanks for your nice work! image Previous works like One-2-3-45 mentioned that the relative camera pose is related to the absolute camera pose, i.e., the elevation. And if the absolute elevation is not considered, the reconstructed 3D shapes could be terrible. It seems that equation 2 does not take this into consideration. Does it implicitly takes this into account, or it ignores it ? Hope for your reply!

kxhit commented 2 days ago

No, EscherNet is designed to handle any 6DoF/4DoF poses, we don't need to estimate the elevation of the input image.