Closed JianhaoZhan closed 4 years ago
Hi, thank you for your excellent work. In your paper the weight-decay sets to 5e-4 but it is 1e-5 in your code which is quiet different. Can you tell me which setting is right or which setting is better .Thanks.
Hi, You should use the settings in the code, i.e. for fine-tuning, weight_decay = 1e-5
Hi, thank you for your excellent work. In your paper the weight-decay sets to 5e-4 but it is 1e-5 in your code which is quiet different. Can you tell me which setting is right or which setting is better .Thanks.