OpenDriveLab / OccNet

[ICCV 2023] OccNet: Scene as Occupancy
https://arxiv.org/abs/2306.02851
Apache License 2.0
562 stars 51 forks source link

Question about the this baseline #38

Open zkyyyyyyy opened 6 months ago

zkyyyyyyy commented 6 months ago

Hi, thanks to your great work.
I found the baseline code of this Challenge based on bevformer you provide does not have alignment operation between prev bev and current bev. In bevformer repository it is defined in "transformer.py". But in "transformerOcc.py" I do not find it. So can I ask why?

8ed535ae1a0db732e5235fac86233a2f
sephyli commented 6 months ago

Hello! The code in this repository builds upon last year's baseline, which deliberately omitted the temporal component of BEVFormer. It appears that the unresolved issue stems from BEVFormer's shift calculations being based on a y-forward coordinate system, while for the occupancy dataset, we utilize the nuScenes ego coordinate system, which is x-forward. Previous contributors likely did not have the time to make the necessary adjustments and opted to remove this feature for convenience.

sephyli commented 6 months ago

We have a fix about the coordinates issue at LaneSegNet repo. You can check it out here.

zkyyyyyyy commented 6 months ago

We have a fix about the coordinates issue at LaneSegNet repo. You can check it out here.

I try to run this code but got no key named "can_bus" in the dict ['img_metas']. Then I go to check the anno .pkl file, actually no 'can_bus' information in it. Can you help me with this issue?

sephyli commented 6 months ago

The OpenOcc dataset doesn't construct a can_bus tensor. You can canstruct one or modify the code as you like.

sephyli commented 6 months ago

Here is a doc for can bus.

Icecream-blue-sky commented 5 months ago

The OpenOcc dataset doesn't construct a can_bus tensor. You can canstruct one or modify the code as you like.

So will the current baseline(pre_bev is None by default) perform worse than the case using and aligning pre_bev ?