What is the retraining strategy for the Autoformer ?

microsoft / Cream

This is a collection of our NAS and Vision Transformer work.

MIT License

1.62k stars 220 forks source link

What is the retraining strategy for the Autoformer ? #161

Open mingkai-zheng opened 1 year ago

mingkai-zheng commented 1 year ago

Hello, I'm pretty curious about the retraining strategy you used in Table 4. (I understand you use the fine-tuning setting by default). For retraining, do you use the same training strategy with DeiT? or the one you presented in Table 3 with 300 epochs?