Closed glee1228 closed 2 years ago
Hi @glee1228. Thank you very much for your feedback. The major discrepancy between your run and ours that I can spot is that we used a learning rate of 1e-5
for the training of the coarse-grained student. However, in your case, you used the default value (i.e. 1e-4
, used for the training of the fine-grained student). According to our empirical findings, this has a significant impact on the performance; therefore, this might be the reason for the large performance difference. I have also modified the README.md for the use of the correct learning rate value.
Hence, could you please train the network with 1e-5
learning rate? If there is still such significant performance difference, please let me know to look further into it.
Hi again. I have revisited this issue on a clean install on a new machine. A couple more things were different than our run generated the provided pretrained weights. More precisely, the Attention layer was activated with --attention true
, and the teacher used for training was the fg_att_student_iter2
. To this end, running the following script:
python train_student.py --student_type coarse-grained --learning_rate 1e-5 --attention true --teacher fg_att_student_iter2 --experiment_path ~/experiments/DnS_method --trainset_hdf5 ~/features/dns_100k.hdf5
it achieved the following results:
===== FIVR-5K Dataset =====
Queries: 50 videos
Database: 5000 videos
----------------
DSVR mAP: 0.6364
CSVR mAP: 0.6482
ISVR mAP: 0.6062
Also, since the performance on FIVR-5K is quite volatile, I have added a validation option to the training script based on this dataset. See the last bullet in the corresponding README.md section for more details.
Hi, thanks for your work!
The results I obtained and the evaluation results of the paper are different, but I don't know which part I'm missing.
And, Could you share FIVR-200K, DNS-100K Original Videos with me?
My Training execution code is as follows.
python train_student.py --student_type coarse-grained --experiment_path experiments/dns_students --trainset_hdf5 /mldisk/nfs_shared_/dh/datasets/dns_100k.hdf5
My evaluation execution code is as follows.
python evaluation_student.py --student_path /workspace/distill-and-select/experiments/dns_students/model_cg_student.pth --dataset FIVR-5K --dataset_hdf5 /mldisk/nfs_shared_/dh/datasets/fivr_200k.hdf5
As for the parameters of training and evaluation codes, the code provided was used as it was.
The performance evaluation of the coarse-grained student model in the paper is
The performance evaluation of my coarse-grained student model
The text below is the result of my execution.