brightmart / nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
MIT License
9.41k stars 1.54k forks source link

The exact English pretraining data and Chinese pretraining data that are exact same to the BERT paper's pretraining data. #39

Open guotong1988 opened 3 years ago

guotong1988 commented 3 years ago

Any one know where to get them? Thank you and thank you.