megvii-research / MOTRv2

[CVPR2023] MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors
Other
359 stars 45 forks source link

Do you use detection or track bbox for proposal generation? #34

Closed wenyuqing closed 1 year ago

wenyuqing commented 1 year ago

Hi, I'm a little confused how to generate det_bbox.json for Dancetrack? The first question is what is the yolox weight? Is it obtained by pretraining Bytetrack on Dancetrack dataset? Do you use other datasets to train Bytetrack? The second question is do you use the detection bbox of yolox or use the track bbox of Bytetrack? I am wondering whether track bbox will be better?

zyayoung commented 1 year ago
  1. https://github.com/zyayoung/ByteTrackInference
  2. The yolox weight is from https://github.com/noahcao/OC_SORT/blob/master/docs/MODEL_ZOO.md
  3. we use the detection bbox of yolox for training and inference.
wenyuqing commented 1 year ago

Thanks a lot for your quick reply. I'm still a little bit confused about the yolox weights. Which model exactly is used and have you tried other yolox weights? I previously thought that the yolox weight is from training yolox with ByteTrack tracker on DanceTrack training set, which according to the results in this link you provided in the paper: [https://github.com/DanceTrack/DanceTrack] as you provided in the paper ( 47.7 HOTA) image

But it seems that you use OC-SORT tracker rather than ByteTrack tracker? Do you mean you use the model that have 55.1 HOTA results on DanceTrack test set? image

Have you tried other yolox weights such as the yolox pretrained with ByterTrack tracker(47.7 HOTA)? Will it influence the overall performance a lot?

Looking forward to your reply, thanks!