The setting of `prev_query_embed` in `DeformableTransformer`

timmeinhardt / trackformer

Implementation of "TrackFormer: Multi-Object Tracking with Transformers”. [Conference on Computer Vision and Pattern Recognition (CVPR), 2022]

Apache License 2.0

516 stars 117 forks source link

Hi, thanks for your great works!

I found that prev_query_embed of track query in deformable_transformer.py

https://github.com/timmeinhardt/trackformer/blob/df70fef0539dc6ebe8ed26bf1ce55dd6e8f87968/src/trackformer/models/deformable_transformer.py#L214

is set to zeros. However, the query_embed of detection query is learned end-to-end, which is in fact the postional embeddings. Why you do such settings? From the commented lines (line 215-220), it seems that you have tried different settings of prev_tgt and prev_query_embed. Does the performance differ a lot with these different settings?

timmeinhardt / trackformer

The setting of `prev_query_embed` in `DeformableTransformer` #54