zju3dv / manhattan_sdf

Code for "Neural 3D Scene Reconstruction with the Manhattan-world Assumption" CVPR 2022 Oral
https://zju3dv.github.io/manhattan_sdf/
Other
491 stars 35 forks source link

Different offset & scale values when performing the camera normalization by myself #20

Closed OasisYang closed 1 year ago

OasisYang commented 1 year ago

Hi, appreciate for releasing the codes!

I attempt to do the camera normalization step according to this instruction, but I cannot obtain the same offset and scale value provided by you. For instance, the provided offset and scale of scene0050_00 are [4.24910, 2.30138, 1.15986] and [0.40543], while the parameters I got are [4.2678, 2.2656, 1.2732] and [0.4167]. I directly borrowed the code from VolSDF with some modifications to fit the format of scannet annotations.

They are slightly different, could you explain where the difference comes from and will it affect the final performance a lot?

Thanks a lot.

ghy0324 commented 1 year ago

Hi! Thanks for your interest! Do you use all frames or use 1/10 frames like us? If you use all frames, maybe that's the reason. And I don't think it will affect the final performance. Small perturbation can be tolerated as long as the requirements listed here are fulfilled. Moreover, we will transform the extracted mesh to original coordinate here, so the evaluation will not be affected by slightly different offset & scale values.

OasisYang commented 1 year ago

Hi, may I ask how to obtain the ground-truth scene mesh file? Seems the mesh file used in this work is different from the one provided by ScanNet.

ghy0324 commented 1 year ago

Hi, may I ask how to obtain the ground-truth scene mesh file? Seems the mesh file used in this work is different from the one provided by ScanNet.

The mesh provided by ScanNet is obtained by fusing all RGBD frames. Since we do not use all frames, we fuse the selected frames to make the evaluation more reasonable. And we recommend you to use open3d for fusion if you want to run on custom data.

LiXinghui-666 commented 1 year ago

Hi, appreciate for releasing the codes!

I attempt to do the camera normalization step according to this instruction, but I cannot obtain the same offset and scale value provided by you. For instance, the provided offset and scale of scene0050_00 are [4.24910, 2.30138, 1.15986] and [0.40543], while the parameters I got are [4.2678, 2.2656, 1.2732] and [0.4167]. I directly borrowed the code from VolSDF with some modifications to fit the format of scannet annotations.

They are slightly different, could you explain where the difference comes from and will it affect the final performance a lot?

Thanks a lot.

Hi, I want to ask how to calculate the offset and scale after the scale_mat is obtained from the original VolSDF code. Thanks a lot.