SJTU-LuHe / TransVOD

The repository is the code for the paper "End-to-End Video Object Detection with Spatial-TemporalTransformers"
Apache License 2.0
212 stars 28 forks source link

single train & multi train #39

Open jojolee123 opened 1 year ago

jojolee123 commented 1 year ago

Hello, Thank you for your nice work about "TransVOD"!

I have a question here: "single train" only trains the first half of the network, after learning the output head after STD, the fixed weight begins to train the full network, so why not train the output head and the temporal network together? because of Slow convergence?

Waiting for your reply!