PJLab-ADG / LoGoNet

[CVPR2023] LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion
248 stars 17 forks source link
3d-object-detection autonomous-driving cvpr2023 lidar-camera-fusion

LoGoNet

This repo is the official implementation of: LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion.

Paper

Framework

image

News

Algorithm Modules

  detection
  ├── al3d_det
  │   ├── datasets
  │   │   │── DatasetTemplate: the basic class for constructing dataset
  │   │   │── augmentor: different augmentation during training or inference
  │   │   │── processor: processing points into voxel space
  │   │   │── the specific dataset module
  │   ├── models: detection model related modules
  |   |   │── fusion: point cloud and image fusion modules
  │   │   │── image_modules: processing images
  │   │   │── modules: point cloud detector
  │   │   │── ops
  │   ├── utils: the exclusive utils used in detection module
  ├── tools
  │   ├── cfgs
  │   │   │── det_dataset_cfgs
  │   │   │── det_model_cfgs
  │   ├── train/test/visualize scripts  
  ├── data: the path of raw data of different dataset
  ├── output: the path of trained model
  al3d_utils: the shared utils used in every algorithm modules
  docs: the readme docs for LoGoNet

Running

💥 This project relies heavily on Ceph storage. Please refer to your file storage system to modify the file path.

Main results

Performances on Waymo with AP/APH (L2)

*We report average metrics across all results. We provide training / validation configurations, pretrained models for all models in the paper. To access these pretrained models, please send us an email with your name, institute, a screenshot of the the Waymo dataset registration confirmation mail, and your intended usage. Please note that Waymo open dataset is under strict non-commercial license so we are not allowed to share the model with you if it will used for any profit-oriented activities. However, we can provide the logs. Model mAPH_L2 VEH_L2 PED_L2 CYC_L2 Log
LoGoNet-1frame (val) 71.38 71.21/70.71 75.49/69.94 74.53/73.48 log
LoGoNet-3frames (val) 74.86 74.60/74.17 78.62/75.79 75.44/74.61 log
LoGoNet-5frames (val) 75.54 75.84/75.38 78.97/76.33 75.67/74.91 log
LoGoNet-5frames (test) 77.10 79.69/79.30 81.55/78.91 73.89/73.10 Record
LoGoNet_Ens (test) 81.02 82.17/81.72 84.27/81.28 80.93/80.06 Record
LoGoNet_Ens_v2 (test) 81.96 82.75/82.32 84.96/82.10 82.36/81.46 Record

Performances on KITTI with mAP

*We report average metrics across all results. We provide training / validation configurations, pretrained models for all models in the paper. Model Car@40 Ped@40 Cyc@40 Log
LoGoNet (val) 87.13 64.46 79.84 log | weights
LoGoNet (test) 85.87 48.57 73.61 Record

Acknowledgement

We sincerely appreciate the following open-source projects for providing valuable and high-quality codes: