GANWANSHUI / SimpleOccupancy

(IEEE TIV) A Comprehensive Framework for 3D Occupancy Estimation in Autonomous Driving
203 stars 11 forks source link

SimpleOccupancy (IEEE TIV 2024)

Arxiv Paper

IEEE TIV Early Access (recommend to read this one)

Supplement

中文解读

A Comprehensive Framework for 3D Occupancy Estimation in Autonomous Driving

Wanshui Gan, Ningkai Mo, Hongbin Xu, Naoto Yokoya

Previous used titles, 1) A Simple Framework for 3D Occupancy Estimation in Autonomous Driving, 2) A Simple Attempt for 3D Occupancy Estimation in Autonomous Driving. 谁懂啊,QwQ

News

-- Note that the demos are large, please wait a moment to load them.

Demo

RGB, Depth and Mesh:

Self-supervised learning with SDF (Max depth = 52 m )


Self-supervised learning with Density (Max depth = 52 m)

Sparse occupancy prediction:

Dense occupancy prediction:

Point-level training as the pretrain for 3D semantic occupancy:

Abstract

The task of estimating 3D occupancy from surrounding-view images is an exciting development in the field of autonomous driving, following the success of Bird's Eye View (BEV) perception. This task provides crucial 3D attributes of the driving environment, enhancing the overall understanding and perception of the surrounding space. In this work, we present a simple framework for 3D occupancy estimation, which is a CNN-based framework designed to reveal several key factors for 3D occupancy estimation, such as network design, optimization, and evaluation. In addition, we explore the relationship between 3D occupancy estimation and other related tasks, such as monocular depth estimation and 3D reconstruction, which could advance the study of 3D perception in autonomous driving. For evaluation, we propose a simple sampling strategy to define the metric for occupancy evaluation, which is flexible for current public datasets. Moreover, we establish a benchmark in terms of the depth estimation metric, where we compare our proposed method with monocular depth estimation methods on the DDAD and Nuscenes datasets and achieve competitive performance.

Method

Proposed network:

Occupancy label and metric comparison:

The proposed discrete depth metric and chamfer distance metric comparison:

Acknowledgement

Many thanks to these excellent projects:

Related Projects:

This work was partially supported by JST, FOREST under Grant Number JPMJFR206S. Wanshui Gan would also like to express gratitude for the student scholarship provided by TIER IV.

Bibtex

If you find this repository/work helpful in your research, welcome to cite the paper and give a ⭐.

@article{gan2023simple,
  title={A Simple Attempt for 3D Occupancy Estimation in Autonomous Driving},
  author={Gan, Wanshui and Mo, Ningkai and Xu, Hongbin and Yokoya, Naoto},
  journal={arXiv preprint arXiv:2303.10076},
  year={2023}
}

@article{gan2024comprehensive,
  title={A Comprehensive Framework for 3D Occupancy Estimation in Autonomous Driving},
  author={Gan, Wanshui and Mo, Ningkai and Xu, Hongbin and Yokoya, Naoto},
  journal={IEEE Transactions on Intelligent Vehicles},
  year={2024},
  publisher={IEEE}
}
@article{gan2024gaussianocc,
  title={GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting},
  author={Gan, Wanshui and Liu, Fang and Xu, Hongbin and Mo, Ningkai and Yokoya, Naoto},
  journal={arXiv preprint arXiv:2408.11447},
  year={2024}
}