antoyang / TubeDETR

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
Apache License 2.0
167 stars 8 forks source link

Pretrained models' performance doesn't match the result #7

Closed ykxixi closed 1 year ago

ykxixi commented 2 years ago

Hi, I download the checkpoints pretrained on HC-STVG2.0, but the result is: viou:0.3555, viou@0.3: 0.5675, viou@0.5: 0.3000. I also find the loss is larger than 25, and the loss of the 0 epoch is almost 58. I have change the stride and resolution to match the checkpoints' training configuration. Did I miss something?

antoyang commented 2 years ago

Hi, I have just reevaluated the checkpoint for HC-STVG2.0 res=352 stride=4 with the command 'python -m torch.distributed.launch --nproc_per_node=8 --use_env main.py --eval --ema --combine_datasets=hcstvg --combine_datasets_val=hcstvg --dataset_config config/hcstvg.json --v2 --output-dir=hcstvgv2k4res352 --load --stride=4 --resolution=352' and obtained the reported performance. I have put the log_stats.json here for more details: https://drive.google.com/file/d/1lACP-bBb6mwpv00et7tqEdsWq633QSMv/view?usp=sharing.