jbrry / Irish-BERT

Repository to store helper scripts for creating an Irish BERT model.
Other
9 stars 0 forks source link

Why does electra trained for 24h not perform well? #81

Open jowagner opened 3 years ago

jowagner commented 3 years ago

Find out why we did not succeed in training a usable electra model with 24 hour computation budget in issue #76 when the 48 hour BERT models perform ok and electra is supposed to reach good performance levels much more quickly.

Reading: