Closed SrinjaySarkar closed 10 months ago
Hi,
I model the NeRF by tightly bounding it using a 3D box. During ray casting, only points inside the box will be processed by NeRF network. This method is first proposed by neural-scene-graphs. You can check out the ray-box intersection algorithm that doing this thing.
Thanks for the reply and the resources. Did you train your network using KITTI object detection dataset or the KITTI tracking dataset ? Looking forward to the code release.
Hi, I am not trained on the KITTI dataset. I train my network on StyleGAN-generated images, and augment the objects to KITTI or nuscenes dataset. Note that our setting is different from neural-scene-graphs. They are doing a reconstruction task, while Lift3D is an unconditional generation framework.
HI, sorry for the confusion. Thanks for the explanation. Looking forward to your code.
Hi @Len-Li , could you please point out the function that gets the camera pose from the bounding box parameters ? In infer.py
you get the camera pose using the get_campara_blender
function.
Hi @Len-Li , could you please point out the function that gets the camera pose from the bounding box parameters ? In
infer.py
you get the camera pose using theget_campara_blender
function.
Hi, sorry for the late. The camera pose is actually with (x,y,z)=(0,0,0), and rotation is a unit matrix. Since we only model our environment in a single frame, we do not need to model additional camera pose.
Thank you for your work Lift3D. In the supplementary of your paper , you mention that you get the object(car) pose from a 3D bounding box parameter.
Can you please explain or provide some code how you did this ? Thanks.