If my understanding is correct, the depth D of a 3D point is its Z value in a camera's local coordinate system. It is different from the ray distance t, which is the distance from the point to the camera center. In fact, D = sin(θ) × t, where θ is the angle between the camera plane and the ray. Therefore, even in an ideal case, the distribution of h(t) and D are not expected to be the same.
However, in section 3.2, I see you minimizing the KL divergence between these two distributions. Wouldn't that cause some problems? Or did I miss something?
I am new to nerf, please forgive me if the answer is trivial.
Hi,
Thank you for the great work.
If my understanding is correct, the depth D of a 3D point is its Z value in a camera's local coordinate system. It is different from the ray distance t, which is the distance from the point to the camera center. In fact, D = sin(θ) × t, where θ is the angle between the camera plane and the ray. Therefore, even in an ideal case, the distribution of h(t) and D are not expected to be the same.
However, in section 3.2, I see you minimizing the KL divergence between these two distributions. Wouldn't that cause some problems? Or did I miss something?
I am new to nerf, please forgive me if the answer is trivial.
Thank you