facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
MIT License
30.43k stars 6.4k forks source link

The exact English pretraining data and Chinese pretraining data that are exact same to the BERT paper's pretraining data. #3372

Open guotong1988 opened 3 years ago

guotong1988 commented 3 years ago

Any one know where to get them? Thank you and thank you.

stale[bot] commented 3 years ago

This issue has been automatically marked as stale. If this issue is still affecting you, please leave any comment (for example, "bump"), and we'll keep it open. We are sorry that we haven't been able to prioritize it yet. If you have any new additional information, please include it with your comment!