Closed LisaWang0306 closed 2 years ago
For example, suppose we are working on the MNLI dataset. We first fine-tune a pre-trained BERT model (e.g., bert-base-uncased) on MNLI, and this fine-tuned model serves as the teacher.
Thanks!
Hi @LisaWang0306 , may I ask that how to get the finetune model as you mentioned in bert-base-uncased
! Many thanks!
In README, you mentioned that: