Our solution is the paradigm of tracking-by-detection, which allows us to improve detection and tracking separately and facilitate engineering practices.
The architecture can be divided into two parts: the detector and the tracker. We choose PP-YOLOE+ (L) as the detector and BoT-SORT (w/o ReID) as the tracker, and they are practical approaches in the field.
The CTMC dataset[^CTMC-v1] is divided into the first 75% (training set) and the second 25% (validation set).
To use this data for the detector, we sample different cells at certain intervals to reduce the number of redundant images. Finally, a subset of these image sequences was used to produce an object detection dataset in VOC/COCO format, yielding a training set of 6910 images and a validation set of 2280 images.
When using this data for the tracker, this data is not needed for training and is only used for tracker evaluation since we are not using ReID. At this time, the detector will predict all images in the latter 25% of CTMC and will not sample frames as before.
The "+" in PP-YOLOE+ means industrial practical upgrade, which includes the following three improvements:
Visualization of training logs: VisualDL.
This is the evaluation result of our detector on the validation set.
Method (pretrained, epochs) | AP | AP50 | AP75 |
---|---|---|---|
PP-YOLOE (IMAGENET, 300) | 41.6 | 85.7 | 34.2 |
PP-YOLOE+ (COCO, 80) | 41.8 (+0.2) | 85.9 (+0.2) | 33.7 (-0.5) |
PP-YOLOE+ (Objects365, 80) | 42.2 (+0.8) | 86.5 (+0.8) | 35.0 (+0.8) |
The flow of the method is shown in the figure below. Where the detector is replaced with our PP-YOLOE+. And the ReID module was not added considering the similarity of cell morphology and appearance.
The following are the evaluation results of the related method on the CTMC validation set. Where the FPS is measured in the model training phase, and no module such as RepVGG is exported for optimization.
Method | MOTA ↑ | IDF1↑ | ID Sw.↓ | FPS↑ |
---|---|---|---|---|
PP-YOLOE & DeepSORT | 70.5% | 78.5% | 608 | 6.90 |
PP-YOLOE & ByteTrack | 70.3% | 79.1% | 445 | 18.23 |
PP-YOLOE+ & BoT-SORT | 71.0% | 81.6% | 324 | 17.39 |
Method | MOTA ↑ | IDF1↑ | ID Sw.↓ | TRA↑ |
---|---|---|---|---|
PPByteTrack | 51.0% | 58.5% | 2010 | 55.16 |
PPBoTSORT | 51.6% (+0.6%) | 62.2% (+3.7%) | 2211 (-201) | 58.28 (+3.12) |
Details about the code of this method are already available in PaddleDetection[^PaddleDetection].
[^CTMC-v1]: CTMC: Cell Tracking with Mitosis Detection Dataset Challenge [^PP-YOLOE]: PP-YOLOE: An evolved version of YOLO [^BoT-SORT]: BoT-SORT: Robust Associations Multi-Pedestrian Tracking [^PaddleDetection]: PaddleDetection: BoT-SORT