The official implementation of aLRP Loss. Our implementation is based on mmdetection. You can also find a different implementation based on official AP Loss repository in this link.
aLRP Loss: A Ranking-based, Balanced Loss Function Unifying Classification and Localisation in Object Detection,
Kemal Oksuz, Baris Can Cam, Emre Akbas, Sinan Kalkan, NeurIPS 2020. (arXiv pre-print)
Average Localisation-Recall-Precision (aLRP) Loss is a ranking-based loss function to train object detectors by unifying localisation and classification branches. We define aLRP Loss as the average Localisation Recall Precision [1] errors on positive examples. To tackle the nondifferentiable nature of ranking during backpropagation, we combine error-driven update of perceptron learning with backpropogation by generalizing the training approach of AP Loss [2] to ranking-based loss functions (see Section 4 in the paper for details).
With this formulation, aLRP Loss (i) enforces the predictions with large confidence scores to have better localisation, and correlates the classification and localisation tasks (see Figure below), (ii) has significantly less number of hyperparameters (i.e. only 1 hyperparameter) than the conventional loss formulation (i.e. the combination of classification and regression losses by a scaler weight), and (iii) guarantees balanced training (see Theorem 2 in the paper).
Please cite the paper if you benefit from our paper or repository:
@inproceedings{aLRPLoss,
title = {A Ranking-based, Balanced Loss Function Unifying Classification and Localisation in Object Detection},
author = {Kemal Oksuz and Baris Can Cam and Emre Akbas and Sinan Kalkan},
booktitle = {Advances in Neural Information Processing Systems (NeurIPS)},
year = {2020}
}
Method | Backbone | Scale | AP (test-dev) | AP (minival) | Model | Log |
---|---|---|---|---|---|---|
AP Loss* | ResNet-50 | 500 | 35.7 | 35.4 | model | log |
aLRP Loss (GIoU)* | ResNet-50 | 500 | 39.5 | 39.0 | model | log |
aLRP Loss (GIoU+ATSS) | ResNet-50 | 500 | 41.3 | 41.0 | model | log |
aLRP Loss (GIoU+ATSS) | ResNet-101 | 500 | 42.8 | 42.2 | model | log |
aLRP Loss (GIoU+ATSS) | ResNext-101-64x4d | 500 | 44.6 | 44.5 | model | log |
aLRP Loss (GIoU+ATSS) | ResNet-101 | 800 | 45.9 | 45.4 | model | log |
aLRP Loss (GIoU+ATSS) | ResNext-101-64x4d | 800 | 47.8 | 47.2 | model | log |
aLRP Loss (GIoU+ATSS) | ResNext-101-64x4d-DCN | 800 | 48.9 | 48.6 | model | log |
*Following the learning rate scheduling adopted by AP Loss[2], these models are trained for 100 epochs by decreasing learning rate in 60th and 80th epochs. The rest of the models are trained for 100 epochs by scheduling the learning rate in 75th and 95th epochs.
Method | Backbone | AP (minival) | oLRP (minival) | Model | Log |
---|---|---|---|---|---|
Focal Loss+Smooth L1 | ResNet-50 | 38.3 | 68.8 | model | log |
AP Loss+Smooth L1 | ResNet-50 | 36.5 | 69.8 | model | log |
aLRP Loss | ResNet-50 | 39.7 | 67.2 | model | log |
Method | Backbone | AP (minival) | oLRP (minival) | Model | Log |
---|---|---|---|---|---|
Cross Entropy+Smooth L1 | ResNet-50 | 37.8 | 69.3 | model | log |
Cross Entropy+GIoU Loss | ResNet-50 | 38.2 | 69.0 | model | log |
aLRP Loss | ResNet-50 | 40.7 | 66.7 | model | log |
The configuration files of all models listed above can be found in the configs/alrp_loss
folder. You can follow getting_started.md for training code. As an example, to train aLRP Loss (GIoU+ATSS) on 4 GPUs as we did, use the following command:
./tools/dist_train.sh configs/alrp_loss/alrp_loss_retinanet_r50_fpn_ATSS_100e_coco500.py 4
The configuration files of all models listed above can be found in the configs/alrp_loss
folder. You can follow getting_started.md for test code. As an example, to test aLRP Loss (GIoU+ATSS), first download or train a model, then use the following command to test on multiple GPUs:
./tools/dist_test.sh configs/alrp_loss/alrp_loss_retinanet_r50_fpn_ATSS_100e_coco500.py -PATH-TO-TRAINED-MODEL 4 --eval bbox
You can also test a model on a single GPU with the following example command:
python tools/test.py configs/alrp_loss/alrp_loss_retinanet_r50_fpn_ATSS_100e_coco500.py -PATH-TO-TRAINED-MODEL 4 --eval bbox
Following MMDetection, this project is released under the Apache 2.0 license.
[1] Oksuz K, Cam BC, Akbas E, Kalkan S, Localization recall precision (LRP): A new performance metric for object detection, ECCV 2018.
[2] Chen K, Li J, Lin W, See J, Wang J, Duan L, Chen Z, He C, Zou J, Towards Accurate One-Stage Object Detection With AP-Loss, CVPR 2019 & TPAMI.
This repo is maintained by Kemal Oksuz and Baris Can Cam.