1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.
I am reproducing your results following the training instruction i.e using the command: CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 \
python _ET_pp_main_deitS.py\
--tag /deit_S \
--epoch 200 \
--seed 0
The script generates the following:
Please notice the epochs are wrong!
I am reproducing your results following the training instruction i.e using the command: CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 \ python _ET_pp_main_deitS.py\ --tag /deit_S \ --epoch 200 \ --seed 0
The script generates the following: Please notice the epochs are wrong!
`res_list: [96, 96, 160, 160, 160, 160, 192, 192, 224, 224] bs_list: [512, 512, 512, 512, 512, 512, 256, 256, 256, 256] up_freq_list: [1, 1, 1, 1, 1, 1, 2, 2, 2, 2] replay_times_list: [1, 1, 1, 1, 1, 1, 1, 1, 1, 1] replay_buffer_size_list: [0, 0, 0, 0, 0, 0, 0, 0, 0, 0]
save at: output/deit_S
save at: output/deit_S
save at: output/deit_S
save at: output/deit_S
save at: output/deit_S
save at: output/deit_S
save at: output/deit_S
save at: output/deit_S
save at: output/deit_S
save at: output/deit_S