The exact English pretraining data and Chinese pretraining data that are exact same to the BERT paper's pretraining data.

google-research / electra

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

Apache License 2.0

2.33k stars 352 forks source link

Open guotong1988 opened 3 years ago

guotong1988 commented 3 years ago

Any one know where to get them? Thank you and thank you.