codertimo / BERT-pytorch

Google AI 2018 BERT pytorch implementation
Apache License 2.0
6.11k stars 1.29k forks source link

made a script to generate bert pre-train data #50

Open fucusy opened 5 years ago

fucusy commented 5 years ago

the script is similar to https://github.com/google-research/bert/blob/master/create_pretraining_data.py from google-research. it can convert a document into bert trainning data