Under the current released config(sparse4dv3_temporal_r50_1x8_bs6_256x704.py), _decoupleattn = True, which leads to when apply _self.graphmodel, the _anchorembed and _temp_anchorembed will never been used.
The multi-head-attention only uses instance_feature and temp_instance_feature as inputs.
It seems that the explicit historical anchors is only used to get the __temp_anchorembed however _temp_anchorembed is not used in latter operations, only historical instance feature is used.
May I ask why not use _anchor_embed/temp_anchorembed, does this setting leads to performance drop? Is it a bad idea to collect and aggregate features nearby projected historical detections?
Under the current released config(sparse4dv3_temporal_r50_1x8_bs6_256x704.py), _decoupleattn = True, which leads to when apply _self.graphmodel, the _anchorembed and _temp_anchorembed will never been used.
The multi-head-attention only uses instance_feature and temp_instance_feature as inputs.
It seems that the explicit historical anchors is only used to get the __temp_anchorembed however _temp_anchorembed is not used in latter operations, only historical instance feature is used.
May I ask why not use _anchor_embed/temp_anchorembed, does this setting leads to performance drop? Is it a bad idea to collect and aggregate features nearby projected historical detections?