For finetuning Bart-base with kp20k, could you give away all the parameters for the polynomial decay scheduler you used, specifically, power, num_training_steps? I believe if you release the json file required here, that'd be even better.
Also, could you please let us know how many epochs did you finetune on kp20k? 15 epochs? But I assume you also assume early stopping. If yes, what was the patience value used for early stopping?
Hi,
For finetuning Bart-base with kp20k, could you give away all the parameters for the polynomial decay scheduler you used, specifically,
power
,num_training_steps
? I believe if you release thejson
file required here, that'd be even better.Also, could you please let us know how many
epochs
did you finetune on kp20k? 15 epochs? But I assume you also assume early stopping. If yes, what was thepatience
value used for early stopping?Thanks and keep up your inspiring work!