Open iambestfeeddddd opened 1 year ago
Only analyze some cases and select a threshold, that may be not the best value.
Only analyze some cases and select a threshold, that may be not the best value.
Hmm, I'm a bit confused about how many epochs we need to finetune for unlabeled data. Is 3 a good number or do we need more?
@staoxiao hmm, i have a new question. Do you perform dedup operations on wiki and bookcorpus data? And if there is dedup, I wonder if we should do it after splitting the data into traning sample or dedup with default sample in the raw dataset (before building the dataset).
@staoxiao hmm, i have a new question. Do you perform dedup operations on wiki and bookcorpus data? And if there is dedup, I wonder if we should do it after splitting the data into traning sample or dedup with default sample in the raw dataset (before building the dataset).
No, I didn't perform dedup operation.
There is a paragraph like this on page 4 of your technical report:
I have a bit question as to how you choose the number 0.43. Is there a qualitative method for this selection?