Closed NinedayWang closed 2 years ago
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Closing the issue, since no updates observed. Feel free to re-open if you need any further assistance.
请问不同模型(bert、roberta、macbert的base/large)的学习率和warmup是怎么设置的呢