This repo is the official implementation of: LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion.
SemanticKITTI-C
, nuScenes-C
, and WOD-C
. detection
├── al3d_det
│ ├── datasets
│ │ │── DatasetTemplate: the basic class for constructing dataset
│ │ │── augmentor: different augmentation during training or inference
│ │ │── processor: processing points into voxel space
│ │ │── the specific dataset module
│ ├── models: detection model related modules
| | │── fusion: point cloud and image fusion modules
│ │ │── image_modules: processing images
│ │ │── modules: point cloud detector
│ │ │── ops
│ ├── utils: the exclusive utils used in detection module
├── tools
│ ├── cfgs
│ │ │── det_dataset_cfgs
│ │ │── det_model_cfgs
│ ├── train/test/visualize scripts
├── data: the path of raw data of different dataset
├── output: the path of trained model
al3d_utils: the shared utils used in every algorithm modules
docs: the readme docs for LoGoNet
💥 This project relies heavily on Ceph
storage. Please refer to your file storage system to modify the file path.
*We report average metrics across all results. We provide training / validation configurations, pretrained models for all models in the paper. To access these pretrained models, please send us an email with your name, institute, a screenshot of the the Waymo dataset registration confirmation mail, and your intended usage. Please note that Waymo open dataset is under strict non-commercial license so we are not allowed to share the model with you if it will used for any profit-oriented activities. However, we can provide the logs. | Model | mAPH_L2 | VEH_L2 | PED_L2 | CYC_L2 | Log |
---|---|---|---|---|---|---|
LoGoNet-1frame (val) | 71.38 | 71.21/70.71 | 75.49/69.94 | 74.53/73.48 | log | |
LoGoNet-3frames (val) | 74.86 | 74.60/74.17 | 78.62/75.79 | 75.44/74.61 | log | |
LoGoNet-5frames (val) | 75.54 | 75.84/75.38 | 78.97/76.33 | 75.67/74.91 | log | |
LoGoNet-5frames (test) | 77.10 | 79.69/79.30 | 81.55/78.91 | 73.89/73.10 | Record | |
LoGoNet_Ens (test) | 81.02 | 82.17/81.72 | 84.27/81.28 | 80.93/80.06 | Record | |
LoGoNet_Ens_v2 (test) | 81.96 | 82.75/82.32 | 84.96/82.10 | 82.36/81.46 | Record |
*We report average metrics across all results. We provide training / validation configurations, pretrained models for all models in the paper. | Model | Car@40 | Ped@40 | Cyc@40 | Log |
---|---|---|---|---|---|
LoGoNet (val) | 87.13 | 64.46 | 79.84 | log | weights | |
LoGoNet (test) | 85.87 | 48.57 | 73.61 | Record |
We sincerely appreciate the following open-source projects for providing valuable and high-quality codes:
If you find our paper useful, please kindly cite us via:
@inproceedings{logonet,
title={LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal Fusion},
author={Xin Li and Tao Ma and Yuenan Hou and Botian Shi and Yuchen Yang and Youquan Liu and Xingjiao Wu and Qin Chen and Yikang Li and Yu Qiao and Liang He},
booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year = {2023}
}
lixin@pjlab.org.cn
and shibotian@pjlab.org.cn
.