Question about pose and fine-sample

LZL-CS commented 2 years ago

Hi, I am confused about the poses and fine sample, which details as blew:

Q1: I know that poses_homo are camera-to-world poses (homogeneous: N 4 4), while pose_avg_homo are averaged poses (which means the centre of all poses, as well as homogeneous: N 4 4). But I am confused about why have to left-multiply as np.linalg.inv(pose_avg_homo) @ poses_homo, and how can we derivate this formula? https://github.com/ActiveVisionLab/DFNet/blob/45880b7e6230aa278ea0da7a33110bbf396de71c/dataset_loaders/load_7Scenes.py#L194

Hope for your response, thank you!

chenusc11 commented 2 years ago

Hi, both Q1 and Q2 are the same implementations of the original NeRF paper, which I have not made a change.

The purpose of 1st part is to shift the GT camera poses to the center at (0,0,0). I suppose it is done in a way in which c2w@P_centered = P_original
The 2nd part is a little bit hard to explain. It is essentially inverting the CDF by finding pdf positions from the CDF. At a high-level concept, it is trying to generate more sample pts near the object surface area (refer to section 5.2 of NeRF paper).

LZL-CS commented 2 years ago

Hi @chenusc11, thanks for your reply, I will digest the references you provide.

ActiveVisionLab / DFNet