Cc-Hy / CMKD

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (ECCV 2022 Oral)
Apache License 2.0
107 stars 9 forks source link

CMKD: Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (ECCV 2022 Oral)

Paper

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (arXiv, Supplimentary Included)

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (ECCV Open Access)

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (Waymo Challenge Report)

If you find our papers helpful for your research, you may cite our paper as

@inproceedings{yuhong-CMKD-ECCV2022,
author = {Yu Hong and
Hang Dai and
Yong Ding},
title = {Cross-Modality Knowledge
Distillation Network for Monocular 3D Object
Detection},
booktitle = {{ECCV}},
series = {Lecture Notes in Computer Science},
publisher = {Springer},
year = {2022}
}

Introduction

This is the official implementation of CMKD with OpenPCDet for KITTI and Waymo datasets.

We have also implemented another version with MMDetection3D for Nuscenes dataset.

News

[2023.3.25] We have several updates.

[2023.2.14] We have several updates.

Notice: Due to the short schedule, instructions and pre-trained models will be released gradually in the near future, and there may be many issues and bugs, please feel free to let us know if you have any questions.

[2022.11.20] We release some instructions and pre-trained models covering the KITTI experiments.

This implementation has some differences from our paper, but the core idea is the same. Overall, the current version is faster to train, uses less memory, and has similar performance to the older version.

Waymo experiments and Nuscenes experiments are on the way.

[2022.7.9] Our paper has been accepted by ECCV 2022 as Oral presentation. :fire::fire::fire:

[2022.7.4] Our paper has been accepted by ECCV 2022. :fire::fire:

[2022.5.24] CMKD gets the 3rd place in the 2022 Waymo 3D camera-only detection challenge. :fire:

In the challenge, we simply extend our baseline model from single-camera version to multi-camera version without any challenge-specific skills and achieve good results. Specifically, we use a lightweight res-50 backbone with 20% of the total training samples, no previous frames, no data augmentation, and no training and testing tricks to rank 3rd in the challenge.

Framework Overview

image

BEV Features Generation

image

Use CMKD

Installation

Please follow INSTALL to install CMKD.

Getting Started

Please follow GETTING_START to train or evaluate the models.

Models

KITTI

Teacher Model Car Easy@R40 Car Moderate@R40 Car Hard@R40 Model Teacher Model
CMKD-R50 (kitti train + eigen clean) SECOND 33.36 21.61 17.97 model model
CMKD-R50 (kitti train) SECOND 24.02 15.80 13.22 model model
CMKD-R50 (kitti train + eigen clean) CenterPoint 29.78 21.17 18.41 model model
CMKD-R50 (kitti train) CenterPoint 22.56 16.02 13.52 model model
CMKD-R50 (kitti train + eigen clean) PointPillar 32.25 21.47 18.21 model model
CMKD-R50 (kitti train) PointPillar 23.84 16.44 13.58 model model

Waymo

Coming Soon

Nuscenes

mAP NDS Model
BEVDet-R50 30.7 38.2 -
BEVDet-R50 + CMKD 34.7 42.6 -