First of all, thank you for your outstanding contribution.
For this paper, Faster R-CNN is used as the detector and RPN output is used to carry out quasi-dense sampling and similarity learning, indicating that RoI feature is needed in the tracking process. Does this mean that external detection results cannot be used for tracking in customized scenes,and we have to train the entire network from detectors to trackers? Thank you !
First of all, thank you for your outstanding contribution. For this paper, Faster R-CNN is used as the detector and RPN output is used to carry out quasi-dense sampling and similarity learning, indicating that RoI feature is needed in the tracking process. Does this mean that external detection results cannot be used for tracking in customized scenes,and we have to train the entire network from detectors to trackers? Thank you !