dhlee347 / pytorchic-bert

Pytorch Implementation of Google BERT
Apache License 2.0
589 stars 181 forks source link

Usage #25

Open JingsenZhang opened 4 years ago

JingsenZhang commented 4 years ago

Hello, I have the following questions about the usage part:

1.How do I get the following two files image

2.For the Toronto Book Corpus,Should I download and manually adjust the format?

Looking forward to your reply,and thank you very much!!!

dhlee347 commented 4 years ago
  1. You can get the vocab.txt from google pretrained bert model, get model_steps_345.pt through fine-tuning your models.
  2. Toronto Book Corpus is not available any more. You can use your own text data as in the comment of pretrain.py