jbrry / Irish-BERT

Repository to store helper scripts for creating an Irish BERT model.
Other
9 stars 0 forks source link

Effect of corpus sampling on continued pre-training #107

Open jowagner opened 2 years ago

jowagner commented 2 years ago

Similarly to issue #85, we should investigate how much the noise is from randomness in the selection (and ordering) of training data in continued pre-training.