Closed zhywanna closed 7 months ago
Thanks for your interest! We use camera_cords_grid_2D for the sake of minimal coordinates representation, but theoretically speaking, using camera_cords_grid_3D directly would also work for you.
Thanks for your interest! We use camera_cords_grid_2D for the sake of minimal coordinates representation, but theoretically speaking, using camera_cords_grid_3D directly would also work for you.
Do you mean that using 2D camera coordinates are enough to refine local pose? Or the "local pose" is corresponding to 2D. It's still confusing to me.
And why do we only remove z-axis?
The camera_cords_grid_2D refers to pixel coordinates, which is enough for the network to identify different pixels and map them to local transformations.
Thanks for your great job! During reading your code, I have a problem can't figure out.
https://github.com/rover-xingyu/L2G-NeRF/blob/6fbac3261678cc8791a6834c559b26c04b7b8b7a/model/l2g_nerf.py#L252
Why don't you use
camera_cords_grid_3D
directly, but transfer tocamera_cords_grid_2D
before put inwarp_mlp
? Is that necessary?Thanks again! :)