carla-simulator / carla

Open-source simulator for autonomous driving research.
http://carla.org
MIT License
11.17k stars 3.6k forks source link

Orthographic Projection camera - Undesirable semantic segmentation of buildings from bird's eye view (occupancy grid map) #5932

Closed s-hd closed 1 year ago

s-hd commented 1 year ago

CARLA version: 0.9.13 Platform/OS: Ubuntu 20.04.4 LTS

Hello,

I would like to use a camera ~4m above the ego_vehicle with orthographic projection.

For this, I found a possibility in the following forum that seemed to work well: https://forum.carla.org/t/orthographic-camera-sensor/579/5 Thereby, the attribute "orthographic" is enabled for the "sensor.camera.semantic_segmentation" among others.

Here, some relevant screenshots of the .diff file:

Screenshot from 2022-11-11 13-24-16

Screenshot from 2022-11-11 13-24-38

Now I notice that the result for buildings unfortunately looks like this: Town02: Screenshot from 2022-11-11 12-31-17

Town10HD: town10HD_Screenshot from 2022-11-11 12-49-01

I would expect the buildings to be semantically segmented as "building" rather than "sidewalk" even from above.

I appreciate any hints.

s-hd commented 1 year ago

Another example:

The Ortho-cam:

Screenshot from 2022-11-11 15-27-05

The Semantic-segmentation-cam from 1000m height:

Screenshot from 2022-11-11 15-27-34

It would be desirable to use an orthographic projection, but to classify the buildings "correctly" (as in the semantic segmentation from 1000m).