pmj110119 / RenderOcc

[ICRA 2024] RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision. (Early version: UniOcc)
447 stars 25 forks source link

Depth supervision experiment #45

Closed rmarcuzzi closed 3 months ago

rmarcuzzi commented 3 months ago

Hi! Thanks for sharing your code and congrats on the nice work! I have a question regarding the ablation study in Table V: analysis of depth supervision.

Thank you in advance!

pmj110119 commented 3 months ago

Hi, in our main experiment, we used lidarseg data to sample rays and obtain their corresponding depth/semantic labels. The experiments in Table V were modified based on the main experiment as follows:

  1. In the first row, ray sampling and semantic supervision were kept unchanged, but depth loss was directly disabled.
  2. In the third row, we additionally sampled some rays and their corresponding depth labels obtained through SurroundDepth into the first row of experiments to compute the depth loss.
rmarcuzzi commented 3 months ago

Hi! Thanks for the quick and complete response, it clarifies the experimental setup. I understand now that you always used the semantic GT and that in the third row, you kept the original "lidar rays" for semantic supervision and added some more (corresponding to the depth labels) for depth supervision. Thanks!