question about cam_pose and vox_origin

Jiawei-Yao0812 / NDCScene

Official implementation for the ICCV 2023 paper "NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates Space"

33 stars 3 forks source link

Hi:

I‘m trying to make some inferences using image capture from my own camera. However, I'm a little bit confuse about the cam_pose and vox_origin which are inputs when using NYUv2 dataset.

I try to find some documents discuss about it but I failed.

Here are my questions:

where is the cam_pose in NYUv2 dataset comes from and what is it's world coordinate?
what does vox_origin reqresents? How it is caluated?
In nyu_dataset.py there is an parameter [self.img_D = [0.5538, 6.8243]]. What does it represents, and how does it being computed?

I'm not very familiar with NYUv2 dataset, so if you have some documents discuss about it would you share it? If not, can you explain it with short answer?

Thank you so much!

Jiawei-Yao0812 / NDCScene

question about cam_pose and vox_origin #4