jbrry / Irish-BERT

Repository to store helper scripts for creating an Irish BERT model.
Other
9 stars 0 forks source link

Restrict BERT vocabulary building to clean corpora #54

Open jowagner opened 3 years ago

jowagner commented 3 years ago

We should add an experiment where the vocabulary is restricted to our cleanest corpora, e.g. NCI. See also issue #33.