antoyang / TubeDETR

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
Apache License 2.0
167 stars 8 forks source link

hyper-parameters change #10

Closed Ryan-Wu-13 closed 1 year ago

Ryan-Wu-13 commented 2 years ago

Thank you for your work! How do you determine the hyper-parameters of epochs=20 and batchsize=16 used for training on the hcstvg2.0 dataset? Does changing these parameters have a big impact on performance? Have you tried experimental results with longer epochs?

antoyang commented 2 years ago

I did all experiments on 16 GPUs (with batch size 1 per GPU) that's why I used batch size 16 on all datasets. For HC-STVG2.0, in initial experiments I tried training for up to 30 epochs but observed a saturation of the results between the last 10 epochs, hence the choice of 20 epochs in later experiments.

Ryan-Wu-13 commented 2 years ago

@antoyang Thank you for your reply, do you have the training log of the model hcstvgv2k4res352.pth, could you provide me with it for reference?

antoyang commented 2 years ago

I did not keep the training log, but I put the test log stats for this model here https://drive.google.com/file/d/1lACP-bBb6mwpv00et7tqEdsWq633QSMv/view